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The Theme of This Issue 

By Claudia Lynch, Benchmarks Editor (BITNET: AS04<2>uvrvMi) 

L\ s you have probably already surmised, the theme of the April 1991 issue of 
X benchmarks is “Thesis and Dissertation Production." This is a very broad 
topic, and it is by no means covered exhaustively within these pages. We have tried 
to present various aspects of the process of “doing" a thesis or dissertation We hope 
. 1 you W|M S ct somc S 0 ™ 1 ‘dcas from this issue, whether you arc actively engaged 
m producing a thesis or dissertation or just involved with ongoing scholarly research. 

The first article, “Preparing Your Thesis or Dissertation With a Microcomputer” 
outlines the process one might go through in planning and producing such 'a 
document There are four articles that discuss various sources for available data to 
aid in the research process. Two more articles discuss statistical tools to aid in data 
analysis. Rounding out the ‘Thesis and Dissertation Production” portion of the 
newsletter is an article Irom the University of Colorado at Boulder. This article “The 
D'gital Dissertation: A Better Answer for the Future," was included as a “thought 
piece. Perhaps you will read it and suddenly find yourself thinking about your 
research and the research and publication process in a whole new light. We hope 
so. As always, if you have any questions or comments about the topics presented 
here, contact Academic Computing Services, ISB 119 (565-2324). ■ 
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SERVICES AVAILABLE TO USERS OF THE 
UNT COMPUTING FACILITIES_ 


The UNT Computing Center is located in the Information 
Sciences Building(ISB), Room 119. Phone Numbers: 


Computing Center: 
HelpDesk: 

Micro Support: 
Graphics Lab: 

ISB I/O Area: 

BA I/O Area: 


(817)565-2324 
(817)565-4050 
(817)565-2316.565-2319 
(817)565-3479 
(817)565-3890 
on i/unn». (817)565-2350 

All personnel listed below can be contacted either by calling 
the Computing Center or by sendingthem electronic mm I on 
VM/CMS (ID-codes follow each name. All IDs are on m l- 
NETnodeUNTVMl). 

Benchmark* -ClaudiaLynchiAsoo 

Information & ID-Codes; Disk Space Problems, 

Passwords - Theresa Russell 

Statistical/Research Support - George Morrfcw (Ason, 
Panu Si tti wong (AC 0 «),Phani t Laosirirat iacio 
Academic AD ABAS/COM-PLETE - Cathy Hardy (AC66) 
CRSP & COMPUSTAT Problems - Panu Sitti wong <p.nu>, 
Phanit Laosirirat <ac44> 

Student Programming Problems - CSCI Dept., GAB 
Room 550; BCIS Dept., BA Room 152 

Problems with JCL. Operating Systems, or Com- 
munication/Tenninal Problems - Help Desk 

Data Entry;TestScoring&Analysis - Betty Grise 
Administrative Applications- Coy Hoggard 

Printout Retrieval-ISB or BA I/O Operators 


DIALING-UP UNT COMPUTERS OVER 
THE TELEPHONE 


Phone numbersforaccesBingUNTcomputing systems: 

300-2400 BAUD: (817)565-3300 

300/1200BAUD (817)565-3499 

300-9600 BAUD: (817)565-3461 

300-2400 BAUD: D/FW METRO 792-4140 

Arracode214mu«ldlal817before theMETROt. 
In your communications program, set Data Bits to 7, Pari ty to 

S and Stop Bits to 1. The dial-up numbers have an autobaud 
feature that requires you to hit the <RBTURN> key repeatedly 
once connection with the remote modem is made. This is so 
that the recei vingmodem can determine the appropriate baud 
rate When you have established a communications link, a 
prompt!# for non-metro numbers, UNTModem.> for themetro 
fines) will appear on your screen and you can enter one ol 


MetroLlnes 

UHTModsnu* 

Non-Metro Lines 

1 

System 

N/A 

CALL8040 

MUS1C/SP 

(llnecditingand 

PCWS) 

Connect VM3270 

CALL 3270 



»a ■).- *133^16 


Connect 8oI 

CALL 900 


Connect Ponder 

CALL 780 

EaSEHlHBBa 

Connect Library 

CALL 3000 

UNTIJbrarics- 
on-line card 
catalogue. 


riUUHs l-UH uNivcnai i i wr 

Location 

Computing Center RJE 

Days 

Sunday 

Monday-Saturday 

Times 

Noon-Midnight 

7 a.m. Mon.-Midnight Sat. 

(Open 24 hours/day) 

ISB 110 Terminal Area 

Sunday 

Monday-Thursday 

Friday 

Saturday 

1-11:50 p.m. 

8:00 a.m.-11:50 

8:00 a.m.- 8:50 p.m. 

9 a.m.-8:50 p.m. 

College of Business 

Sunday 

Monday-Thursday 

Friday, Saturday 

Noon - Midnight 

8:15 a.m.-11:45p.m. 

8:15 a.m.-7:45 p.m. 

GAB 550 

Sunday 

Monday-Thursd ay 

Friday 

Saturday 

2 p.m.-Midnight 

8 a.m.-Midnight 

8a.m.-3p.m. 

CLOSED 

Graphics Lab 

Sunday 

Monday-Thursday 

Friday 

Saturday 

1 p.m.-Midnight 

8 a.m.-Midnight 

8a.m.-9p.m. 

9a.m.-9p.m. 

Willis Library 

_-_ 

Sunday 

Monday-Thursday 

Friday 

Saturday 

1 p.m.- Midnight 

7:30 a.m.- Midnight 

7:30 a.m.-9p.m. 

9a.m.-9p.m. 


may be reproduced lor nonprofit purposes provided the publication and issue are lolly acknowledged. 






















Preparing Your Thesis or 
Dissertation With a 
Microcomputer 

By Parra Sitliwong, Academic Computing Consultant —"ABD” in Political Science (BITNET: 
PAN't V&IJNTVMI) 

T he process of preparing your thesis or dissertation involves, in general, three 
activities: 1 

O Preparing your thesis or dissertation prospectus. 

0 Carrying out your research and data analysis. 

0 Reporting the results and findings. 

This article will attempt to show the advantages of using a microcomputer in those 
activities. “HOW TO” instructions arc given when nccccssary. 

Preparing Your Prospectus 

In this stage, unless you arc planning a pilot study, you can use a microcomputer for 
word processing and a bibliographic and references database. Here WordPerfect 
version 5.1 is an excellent tool. Beyond the basic word processing functions and 
capabilities, WordPerfect provides a variety of special functions and capabilities. 
These include tables, graphics, equation composition, automatic table of contents 
and indices generation, spelling checker, on-line thesaurus, etc. In addition, third 
parly software is available that can be used with WordPerfect to customize your 
document. One example of such software is called Thesys. It is a program written 
in WordPerfect macros. This program will help you format your document in various 
styles including APA, Turabicn, etc. Another obvious advantage of a word process¬ 
ing program is its ability to reformat the document automatically as you write, 
rewrite, or rearrange your text. 

You also can use WordPerfect as a reference and bibliography compilation tool. 
With WordPerfect’s sort function, your references database can be arranged and 
organized in various formats as needed. For example, whim sorted by author’s last 
name, it then can be used as part of your references in your document 

Another product appropriate to use in organizing your reference materials is D-Base 
III. The example on page 4 shows a sample entry in a D-Base III file. 

Data Gathering and Analysis 

The advantages a microcomputer can provide at this stage of your thesis or disser¬ 
tation preparation are enormous. Most data that you want to use in your research can 
be arranged in a “Data Matrix” format where columns define data. For example, my 
dissertation requires biographical information of more than 100 Canadian Supreme 
Court variables or measuring items, and rows are subject. A spreadsheet or data- 


1 This article fol lows the same organization as Using Microcomputers in Research by Thomas 
WM. Madron, C.Neal Tate and Robert G. Brookshire (1985, Beverly Hills, CA: Sage 
Publications). It does not, however, try to do an extensive review of microcomputer software. 
The software covered is available to students in the Computing Center public access labs. 


base type program such as Lotas 1 -2-3 
or D-basc can be used to collect and 
organize the court justices. I ased D- 
Base III to compile this information. 
Since D base III files can be directly 
used in SPSS/PC+ and SAS/PC, I can 
easily carry out the data analysis (see 
“Converting Foreign Files to SAS PC 
and SPSS PC Data Sets” in the Septem¬ 
ber 1990 issue of Benchmarks for more 
information about converting D Base 
and Lotus 1-2-3 to SAS PC or SPSS 
PC+ file). 

Madron, Tate, and Brookshire (1985) 
point out several ways which 
microcomputers can be used at this 
stage of your rescrach. These include 
using microcomputer software for 
note-taking and organizing, ques- 
tionaire construction, automated sur¬ 
vey research, etc. 

Microcomputer statistical packages 
provide varieties of statistical proce¬ 
dures ranging from simple procedures 
— frequency distributions, cross- 
tabulations, univariate descriptions, 
etc. —to advanced procedures— 
analysis of variance, multiple regres¬ 
sion, factor analysis, multi-dimcnlional 
scaling, etc. In fact, some packages, 
SAS/PC and SPSS for die Macintosh 
and OS/2 provide procedures identical 
to their mainframe counterparts but 
with more user friendliness. SPSS/PC+ 
provides more than 90 percent of the 
procedures that arc available on the 
mainframe version of SPSS. 

There are several advantages in using 
statistical packages on the microcom¬ 
puter rather than on the mainframe sys¬ 
tem. 

0 First, microcomputer-based pack¬ 
ages provide, in general, a more 
user-friendly environment than the 
mainframe based package. For ex¬ 
ample, SPSS provides a command 
generator which you can use to 
select and paste commands into 
your program. This helps reduce the 
lime it takes to learn SPSS syntax, 
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Author: 

Hanushck. Eric A. and John E. Jackson 

Title: 

Statistical Methods for Social Scientists 

Journal: 

None 

Date: 

1977 

Publisher: 

Academic Press, New York 

Subject: 

Statistics 

Contend: 

Regression Analysis: Theory and Application 

Content2: 

Logit and Probit Analysis 

Contcnt3: 

Matrix Algebra 

D-Base III Reference File 


which in lum allows you to carry 
out your analyses faster. 

© Second you don’t have spend time 
learning the microcomputer’s 
operating system, since microcom¬ 
puter-based statistical packages re¬ 
quire little knowledge about such 
things. It is likely that you will have 
adequate knowledge of the operat¬ 
ing system if you use a microcom¬ 
puter from the beginning of the 
project. 

© Third and most important is the 
sharability of data among 
microcomputer-based software. 
For example, the output from a 
statistical program, both text and 
graphic, can be directly incor¬ 
porated into a word processing pro¬ 
gram. This will help reduce redun¬ 
dancy in preparing your document, 
and save you a lot of time and ener¬ 
gy- 

Statistical analysis based on 
microcomputers is not suitable for 
everyone, however. Microcomputers 
arc limitated as to the amount of disk 
storage and processing power that is 
available. If you are working with a 
large database, i.e., you have either a 
lot of observations or large number of 
variables, or both, you may not have 
enough hard disk space to store all your 
data. 

Your analyses also may require a lot of 
processing power which is not avail¬ 


able on your m icrocompulcr or if avai 1- 
ablc will take a long time processing. 
For example, a Monte Carlo study or 
complex simulation, which often needs 
hours of processing time on a 
mainframe or minicomputer, will 
probably run for days on microcom¬ 
puter. 

The Computing Center has SAS/PC, 
SPSS/PC + and SPSS for the Macin¬ 
tosh available in most of the 
microcomputer labs on campus. All of 
these programs contain advanced 
statistical procedures. They also have 
the ability to produce high resolution 
presentation graphics. 

Reporting the Results and 
Findings 

After data are collected and analysed, 
it’s time to put every thing together. 
Your word processing program, again, 
will play a central role. Careful plan¬ 
ning can help save your time in both 
compiling information and presenting 
it. As they say, word prosessing will not 
make bad writer into a good one, but it 
surely makes a document look good. A 
completed thesis or dissertation will go 
through several revisions before it is 
acceptable by your committee mem¬ 
bers. Revisions can be done easily with 
a word processing program. While the 
text is changed, some formatting such 
as page and chapter numbering, etc. 
will be updated. The ability to cut and 


paste text allows you to move or re-ar¬ 
range the document in vaious ways. It 
also allows you to re-use information 
from different sources. For example, a 
review of literature that is done for the 
proposal can be re-used again as part of 
the document. 

It is likely that your document will 
include some graphical presentation of 
the data. Both SAS/PC GRAPH and 
SPSS/PC+, used in conjunction with 
either Harvard Graphics, Microsoft 
Chart, Chart-Master, Draw Applause, 
or Graftalk, arc capable of generating 
graphic output files that arc recognized 
by WordPerfect version 5.1. Those for¬ 
mats include HPGL, Postscript, CGM, 
etc. You can use the SAS commands 
below to create HPGL output in the file 
called graft .hpg on the c:\graf direc¬ 
tory. 

FILENAME grafout ‘c:\grabgrafl.hpg’; 
GOPTIONS dcvice=hp7550a noprompt 
rotate=landscape gsfmode=replace 
gsfname=grafout; 

Proc gchart; 
etc.; 

When you request a graphics proce¬ 
dure in SPSS/PC+, the program will 
automatically launch the graphical 
program that is available to you. You 
will need to follow each packages' 
method of printing graphs to disk to 
save the needed output You need to 
select the output medium that is 
capable of generating output in the for¬ 
mat you want. For example, you can 
select any of the HP plotters if you want 
to produce the HPGL format file. 

To include graphs in WordPerfect 
select the Filename option when in¬ 
voking Graphics (Alt-F9). Word Per¬ 
fect will prompt for the filename. After 
you enter the filename and press 
<enter>, the graph will be loaded and 
become part of your document 

A good document must be free of any 
spelling or grammatical errors. Most 
microcomputer-based word process- 
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ing programs provide an on-line spell¬ 
ing checker. In addition, they also have 
an on-line thesaurus which allows you 
to select an alternate words for your 
document. 

GRAMMATIK is available in the 
public access labs maintained by the 
Computing Center. This software will 
check for grammatical errors and 
analyze your writing style. Although it 
is not a fool-proof document checker, 
you can use it as a primary checker for 
some obvious errors such as subject 
verb agreement, inconsistent 
capitalization, etc. 

Microcomputers are powerful research 
tools. Proper use of the software and 
hardware can help you produce a good 
looking thesis or dissertation. If you 
have any questions about the topics 
discussed in this article, contact 
Academic Computing Services (565- 
2324) and we will be glad to help 
you.B 


SPSS 4.1 Available for 
Testing on OS/MVS 

T he Computing Center has in¬ 
stalled the new release of 
SPSS on the HDS-8083 Academic 
Mainframe running under 
OS/MVS. This release will solve 
some problems that occurcd in 
SPSS4.0. If you want to execute 
this new version, you will need to 
change your EXEC card as fol¬ 
lows:// EXEC SPSS41 
Your SPSS program will run under 
the new release wi thout any furthcr 
modifications. 

Please inform Academic Comput¬ 
ing Services (565-2324) of any er¬ 
rors you find in this new release. 

If we do not encounter any difficul¬ 
ties, we will switch over from 
SPSS 4.0 to SPSS 4.1 during the 
semester break. ■ 


Using Secondary Data Sources 
for Your Thesis or Dissertation 

By Panu Sittiwong, Academic Computing Consultant —“ABD” in Political Science (BtTNirr- 
PAN! !@>UNTVM1) 

W hen preparing the prospectus for your thesis or dissertation, you may want 
to keep in mind some publicly available secondary data sources. You may 
be able to use these data as suppliment to your primary data source or as the main 
data source for your study. 1 

UNT has vast quantities of machine rcadblc data available to students and faculty 
members. Many of these data files arc maintained by the Computing Center. These 
data are acquired from several sources including the Inter-university Consortium for 
Political and Social Research (ICPSR), the Department of Labor, Center for Re¬ 
search in Security Prices (CRSP), Standard and Poors (COMPUSTAT), etc. In 
addition, through the Internet connection via VAX/VMS, researchers can search and 
obtain the Louis Harris and Associates data holdings at the Institute of Research in 
Social Science (IRSS) at the University of North Carolina, Chapel Hill. 

□ CRSP Data: Once a year UNT receives data from the Center for Research in 
Security Prices. The data include: 

Monthly NYSE and AMEX Returns and Master file; 

Daily NYSE and AMEX Returns file; 

Daily NASDAQ Returns and Master file; 

Daily, Monthly, Quaterly, and Annual Markets Indices file; and 

National Market System Securities file. 

□ COMPUSTAT II Data: Similar to the CRSP data, UNT receives updates of 
COMPUSTAT II data from Standard and Poors once a year. The current holdings 
include: 

Annual Primary. Supplementary, and Tertiary (Industrial and Research) file; 
Annua] Over the Counter file; 

Annual Bank file; and 

Annual Price, Dividends and Earnings file. 

In addition, we are in the process of acquiring some quarterly data. 

□ ICPSR Data: Data from ICPSR constitute the majority of the data archives 
maintained by the Computing Center at UNT. Currently, there are more than 200 
data titles available locally at UNT. As a member of the Consortium, UNT 
students and faculty members can request any data from ICPSR. Data available 
from ICPSR cover a wide range of subjects and disciplines including: 

Public Opinion Surveys 

Election Studies of the U.S. and of foreign countries 
Congressional Roll Calls 
General Social Surveys 


1 Related articles on COMPUSTAT II and CRSP follow this article, on pages 6 & 7. 
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Health Interview Survey 
Consumer Expenditure Survey 
Government Finance 
World Economic Indicators 
Population Surveys 

* Censuses for the U.S. and foreign countries 
0 El'RO-BAROMETER. etc. 

You can search and locate ICPSR data holdings available at UNT by issuing the 
command FINDICPSR from either the MUSIC/SP or VM/CMS prompt. If ihc 
data that you need are not available at UNT, you can request the data by contacting 
Panu Sitliwong at ext. 2324, or by sending electronic mail to either PANU on 
CMS or AC09 on MUSIC. It may take up to 4 weeks before the data can be 
accessed when it is ordered from ICPSR. 

□ I.ouls Harris & Associates Data: In addition to the above data, any researcher 
with access to the Internet can now search the Louis Harris and Associates data 
holdings at the Institute of Research in Social Science (IRSS) at the University 
of North Carolina, Chapel Hill. Keywords in combination can be used to locate 
items of interest. Soon the search will also display frequency distributions for 
each question retrieved. 

IRSS Data Services can be reached through the internet from the UNT 
VAXclustcr. The Internet address for UNCVM1 is: 

UNCVM1.ACS.LNC.EDU or 128.109.157.5 
After logging on the VAX, to connect to UNCVM1, type: 

TN3270 UNCVM1. ACS.UNC.EDU or TN3270 128.109.157 5 
When you connect to UNCVM1, you'll see the standard VM logon banner. At 
the logon line, type: 

IRSS1 

or 

IRSS2 

The system will prompt for a password. The password is IRSS. You will now be 
logged on to the Computer at University of North Carolina, Chapel Hill. The 
procedures should be self-explanatory from this point on. Please note that this 
data archive is available on a remote computer system. Hence, the Computing 
Center stall can only can assist you on a limited basis. 

Data from Other Sources 

In addition to those public data sources mentioned above, you can acquire data from 
various public and private agencies yourself. The Computing Center can handle data 
on many mediums. If you plan to acquire a secondary data source yourself, please 
consult with an Academic Computing Services consultant concerning the method 
and medium that is most appropriate to use in storing the data when it arrives here 
at UNT. ■ 


2 Currently, before using ihe FINDICPSR command on VM/CMS, you must first link to the 
ACAD 291 minidisk. 


COMPUSTATII 

By Phanit Laosirirat, Academic Computing 
Consultant —Political Science Doctoral 
Student (BrTNltT: PIIAN1T@UNTVM1) 

Introduction 

T he COMF1 ISTATII data service 
consists of a computer-readable 
library of financial, statistical, and 
market information covering several 
thousand industrial and nonindustrial 
companies. There arc more than three 
hundred variables available on the main 
industrial tape. The COMPUSTAT data 
service is used as a reference source for 
several publications. For example, it is 
the main source of Business Week 
magazine in ranking its most valuable 
companies each year. More specifical¬ 
ly, COMPUSTAT II constantly updates 
its information on a yearly basis on the 
largest and most significant New York 
and American Stock Exchange com¬ 
panies, as well as Firms trading over- 
the-counter. The coverage is extensive. 
A full range of fundamental data is 
provided, allowing for variety and 
flexibility of usage. Data offered in¬ 
cludes key income statements, balance 
sheets, changes in financial position, 
and market items. 

For convenience in accessing the 
COMPUSTAT data files, all the data 
have been converted to S AS datasets by 
Academic Computing Services. As a 
result, a SAS program is required for 
retrieving any of these data files (other¬ 
wise, FORTRAN or PL/1 programs are 
needed for the complex data array for¬ 
mat). Sample SAS programs will be 
provided later on in this article. 

It is extremely important that users of 
COMPUSTAT datasets consult with the 
COMPUSTAT manual for data items 
and their definitions. The manuals are 
available in the reference section in the 
main library. Call Numbers of the 
manuals as follows: 
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• COMPUSTAT II and RESEARCH 
and OTC COMPUSTAT 
(H 62 .C58483 1989 c.2) 

• PDE COMPUSTAT (H62 P35 
1986) 

• BANK COMPUSTAT (H62 .B32 
1986) 

Data Availability 

The COMPUSTAT II data library con¬ 
sists of several files; INDUSTRIAL 
FILES, a BANK FILE, and a PDE 
FILE. All files are in annual format 
with a 20 year span for each company 
covered. 


Industrial Files 

The Industrial files can be subdivided 
into 5 categories: 

• The Primary Industrial file (ap¬ 
proximately 800 companies) 
specifically includes all companies 
in the SAP Industrial Index, some 
companies in the SAP Utilities 
Index, the Transportation Index, and 
the SAP Financial Index, plus com- 
phnies'of grfcatest interest - primarily 
companies on the New York Slock 
Exchange. 

• The Supplementary Industrial file 

(approximately 800 companies) 
contains companies which on the 
major exchanges but which may 
have a lesser degree of investor in¬ 
terest. 

• The Tertiary file (approximately 
800 companies) completes the 
coverage of industrial companies 
with common stock listed on the 
New York and American Stock Ex¬ 
changes. It also includes ap¬ 
proximately 300 nonindustrial com¬ 
panies which have been modified for 
comparability to industrial com¬ 
panies. The nonindustrial com¬ 
panies arc from the following areas: 
Banks, Utilities, Life Insurance, 
Railroads, Property and Liability, 
and Real Estate Investment Trusts 
(REIT). These nonindustrial com¬ 


panies include some of the com¬ 
panies in the SAP Utilities Index, 
the SAP Transportation Index, and 
the SAP Financial index. 

O The Over-the-Counter file (ap¬ 
proximately 850 companies) con¬ 
tains those companies traded over- 
the-counter that command the 
greatest investor interest, i.e., num¬ 
ber of institutional holders, volume 
traded, price movement, earnings 
growth, regional and/or economic 
importance. 

© The Industrial Research file con¬ 
sists of companies which have been 
deleted from the COMPUSTAT II 
Primary, Supplementary, Tertiary, 
and Over-the-Counter files. Com¬ 
panies may be deleted from active 
files and placed on the research file 
for the following reasons: 

Acquisition or Merger 
Bankruptcy 
Leveraged Buyout 
Liquidation 

Reverse acquisition (from 
1983 forward) 

No longer fits original file for¬ 
mat (from 1978 forward) 

Now a private company 

Other (no longer files with 
SEC. etc.) 

Bank File 

This file has approximately 150 com¬ 
panies and contains data on leading 

United States banking institutions. It is 
available in annual formal with 20 
years of data for each bank. 

PDE File 

The Price-Dividends-Eamings (PDE) 
file contains key market information on 
approximately 7,000 companies and 
approximately 120 industry indexes 
and composites. Each data record con- 


-- -u-hiis. cacn uem contains 

12 months of data. 

Effective COMPUSTAT 
Usage 

Each company in the COMSPUSTAT 
files has several index variables which 
can be used to group companies 
together for effective data retrieval. The 
names of the index variables can be 
found in the Compustat II manual; Sec¬ 
tion 8-B, Appendix B and C in the 
library. The following list designates 
the various company groupings. 

O Group by File. In this case the File 
Identification Code (RLE variable) 
is the variable index. For instance, 
one can extract all the Primary file 
companies from the Total industrial 
file by writing a SAS program to 
select the cases that have a RLE 
variable equal to 01 and 11 . 
Likewise, Tertiary file companies 
can be extracted by checking for a 

® Group by Trading Exchange. The 
Exchange Listing Code (ZLIST 
variable) can be used to group com¬ 
panies by exchange. Thus all New 
York Exchange companies can be 
extracted from the Total Industrial 
file by checking the Exchange List¬ 
ing Code for a 1,2,3,4, and 10. 

® Group by Industry. All companies 
in a specific SIC industry group can 
be identified on the tapes by check¬ 
ing the industry field (DNUM vari¬ 
able) for the desired industry group 
or groups. Thus, all companies in the 
Metal Mining industry group can be 
extracted by checking for an in¬ 
dustry number of 1000. 

O Major S&P Index Companies. All 
companies in the SAP400 Index are 
carried on the Primary Industrial 
file, and there are two methods 
available for identifying these com¬ 
panies. Either the File Identification 
Code (RLE variable) can be check¬ 
ed for an 11, or the Exchange Listing 
Code (ZLIST variable) can be 
checked fora 1 ,11,and 15. 
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After a decision about which group of companies is to be retrieved, the next step is 
to determined the year or years of data to be used. It is highly recommended that 
users retrieve a subset of companies cither by major groupings or by year, instead 
of retrieving the entire data set because retrieval is more efficient and much faster 
that way. 

SAS Program Setup 

The following statements are the Job Control Language for retrieving the most recent 
COMPUSTAT data library files. 


Industrial Files 

/OSJE 

RFTIJRN 

lixXXXCOMP JOB (XXXX,:\5,2):YOUR—NAME'.Cl.NSS=A.PASS'HORD=YYYY 

II EXEC SAS 

//COMPSTAT DD DSN=USER.A000.COMSTAT2.INDUS90.DISP=SHR 
DATA NAME, 

SET COMPSTAT INDUSTRY; 

Research File 

/OSJE 

//jmrXCOMP JOB (YXYY.:15,2).’Vr)t/R— N/tME',n.ASS=A,PASSWORD=mT 
I/ EXEC SAS 

//COMPSTAT DD DSN=USER.A000.COMSTAT2.RF.S90.DISP=SHR 
DATA NAME-, 

SETCOMPSTAT.RESEARCH; 


OTC File 

/OSJE 

RFTIJRN 

IIXXXXC OMP JOB (.XXXX,-A5,2),'YOUR —AMAfU’,CLASS=B,PASSWORD=YYIT 
//COMPSTAT DD DSN=USER.AC44.COMSTAT2.OTC90,DISP=SHR 

DATA NAME, 

SET COMPSTAT.OTC; 


Bank File 

/OSJE 

RFTIJRN 

IIXXXXC OMP JOB (XXXX,:\5.2).'YOUR —M4Af/J',CLASS=B.PASSWORD=YYYy 
// EX EC SAS 

//COMPSTAT DD DSN=USER.AC.t4,COMSTAT2.BANK90.DISP=SHR 
DATA NAME, 

SET COMPSTAT.BANK; 


PDE File 

/OSJE 

RFTIJRN 

IIXXXX COMP JOB (XXXX,-.\5,2),'YOUR —N/tAfE’.CLASS=B,PASSWORD=lTyy 
II EXEC SAS 

//COMPSTAT DD DSN=USER.AC44.COMSTAT2.PDE90.DISP=SHR 
DATA NAME-, 

SET COMPSTAT PDE; 


The cut off date for these tapes is 
07/31/90. The data arc available from 
1970 to 1990. The tapes from the pre¬ 
vious years arc also available. Please 
contact the Computing Center (565- 
2324,1SB 119) for more information. 

Examples 

The SAS program set-up on the follow¬ 
ing page is for retrieving COMPUS¬ 
TAT data. In this example, it is assumed 
that you arc interested in getting finan¬ 
cial information about computer com¬ 
panies. Thus, you would look at the 
industrial code for these companies in 
Appendix C in the COMPUSTAT 
manual. You would find that the in¬ 
dustrial codes for these companies span 
5 numbers: 3570, 3571, 3572, 3575, 
and 3577. You also have decided that 
you want the information only in the 
year 1988. The financial information 
that you want consists of the following: 
Inventories (DATA003), Assets 
(DATA004), Liabilities (DATA005), 
Asscts/Liabilitics and Stockholders' 
Equity (DATA006), Sales (DATA007), 
and Fortune Rank (DATA279). The 
SAS program setup submitted from the 
MUSIC/SP operating system is shown 
on the following page. 


The BCIS Department 
is Really HOT!! 

A ccording to the 1991 Spring 
edition of Sourcebook 
Magazine, the UNT BCIS Depart¬ 
ment has one of the five hottest 
undergraduate IS programs in 
America. Other schools sharing 
the honor are California Polytech¬ 
nic State University at San Luis 
Obispo, Georgia State University 
at Atlanta, University of South 
Florida at Tampa and Bentley Col¬ 
lege, Waltham, Massachusetts. ■ 
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Sample Program Set-up for Retrieving COMPUSTAT Data 


/OSJE 

RETURN 

l/XXXXCO MP JOB (XXXX,: 1 5,2),'YOUR —N4A/fi , ‘,CLASS=A.PASSWORD=>m' 
// EXEc SAS 


//COMPSTAT DD DSN=USER.A000.COMSTAT2.INDUS90 DISP=SHR 
/DATAOUT DD DSN=USER.XXXX.NAMF.DISP=(NEW.KEEP).UNIT=SYSDA. 
II VOL-SER=ACADnn.SPACE=(TRK,(10.10),RLSE) 

TITt.E ’SAMPLE SAS PROGRAM TO READ COMPUSTAT DATA’; 


* Step One: Read COMPUSTAT DATA 

• 

DATA COMPUSTA; 

SET COMSTAT.INDUSTRY; 

KEEP C’NUM CONAME YEAR DATA003 DATA004 DATA005 DATA006 
DATA012 DATA 279; /* Select data items */ 

IF YEAR = 88; f* Select data for year 1988 */ 


* Step Two: Read Companies’ CUSIP number 


DATA CUSIP; 

INPUT <® 30 CNUM SCHAR8.; 
CARDS; 

/INC COMPANY.CUSIP 


* Sort Both Data Sets Before Merging 

PROC SORT DATA=COMPUSTA; 

BY CNUM; 

PROC SORT DATA=CUSIP; 

BY CNUM; 

• 

* Final Step: Create a SAS DATA SET for the company required. 

* The data set will be saved to disk for future use. 

* 

DATA DATAOUT.NAME; 

MERGE COMPUSTA CUSIP (IN=A); 

IF A; /* Keep only those companies that are needed */ 

RUN; 


For the more information about the COMPUSTAT II data, please sec the COMPUS¬ 
TAT handout, “Introduction to COMPUSTAT II,” which is available in the Comput¬ 
ing Center reception area, ISB 119. ■ 


Using IMSL for 
Research 

By George Morrow, Academic Computing 
Consultant (BITNET: AS01@UNTMUSIC) 

T he IMSL Libraries now consist of 
three libraries containing com¬ 
puter routines for analysis in Math, 
Statistics, and Special Functions 
(SFUN). These routines are called from 
a program written in VS FORTRAN. 

O The Math library contains routines 
for analysis in Linear Systems, 
Eigcnsystem Analysis, Interpola¬ 
tion and Approximation, Integration 
and Differentiation, Differential 
Equations, Transforms, Nonlinear 
Equations, Optimization, Basic 
Matrix/Vector Operations, and 
Utilities. 

© The Stat library contains routines 
for Basic Statistics, Regression. 
Correlation, Analysis of Variance, 
Categorical and Discrete Data 
Analysis, Nonparametric Statistics, 
Tests of Goodness of Fit and Ran¬ 
domness, Time Series Analysis and 
Forecasting, Covariance Structures 
and Factor Analysis, and Dis¬ 
criminant Analysis, Cluster 
Analysis, Sampling, Survival 
Analysis, Life Testing and 
Reliability, Multidimensional Scal¬ 
ing, Density and Hazard Estimation, 
Line Printer Graphics, Probability 
Distribution Function and Inverses, 
Random Number Generation, and 
Utilities. 

© The SFUN Library contains 
routines on Elementary Functions, 
Trigonometric and Hyperbolic 
Functions, Exponential Integrals 
and Related Functions, Gamma 
Functions, Error and Related Func¬ 
tions, Bessel Functions, Kelvin 
Functions, Bessel Functions of 
Fractional Order, Elliptic Integrals, 
Wcierstrass Elliptic and Related 
Functions, and Miscellaneous Func¬ 
tions. 
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The following is an example of a FORTRAN run in which an IMSL routine is called 
(ZBRENT in this example). 

//AXmMSL JOB OmXUVYour—Namc\PASSWORD=ymV 
II EXEC FORVCLG 
//FORT.SYSIN DD * 

INTEGER MAXFN. NSIG, IER 
DOUBLE PRECISION F, EPS. A. B 
EXTERNAL F 
A=-10.<) 

B = 0.0 
EPS = 0.0 
NSIG = 3 
MAXFN = 100 

CALL7.BRENT(F,EPS.NSIG.A,B.MAXFN.IER) 

WRITE (6.99909) B. MAXFN 

99999 FORMAT (' THE BEST APPROXIMATION TO THE ZERO OF F IS EQUAL 
TO', 

& F5.1. .’THE NUMBER OF FUNCTION EVALUATIONS'. NS’. 

& ' REQUIRED WAS’.12.'.V/) 

END 

REAL FUNCTION F (X) 

REAL X 

F = X**2 + X - 2.0 

RETURN 

END 

If you have any problems using the IMSL program libraries, contact Academic 
Computing Services, 1SB 119 (565-2324). ■ 


The Digital Dissertation: A Better 
Answer for the Future 

By Greg McArthur. Office of Vice Chancellor for Academic Affairs, University of Colorado 
at Boulder 

This article was reprinted from Buffer, the News journal of Computing at the 
University of Denver, March 1991.lt originally appeared in the May I June 1990 
issue of the University of Colorado's Digit, vol. 25, no.3. 

T hirteen years ago I received my doctorate in Environmental Biology. My 
dissertation dealt with disturbed habitat rcvegcialion and made use of com¬ 
puter-assisted methods to keep track of all the various data that impinge upon things 
environmental. 

Unfortunately, that was about the only computer assistance I had to help me produce 
and then write my dissertation — wordproccssors (indeed, wordproccssing) was not 
yet widely available. This was 1977. 


How to Find Out 
if Your Data are 
Heteroscedastic 

By Phanit Laosirirat, Academic Computing 
Consultant — Political Science Doctoral 
Student (BITNET: PHANrr@UNTVMl) 


M any situations arise during the 
process of data analysis that 
are less than desirable to the researcher. 
Heteroscedasticily is a condition en¬ 
countered when performing regression 
analysis of data. It occurs when the 
variance of the residuals or the error 
terms is not constant. That means it 
changes from one setting of the inde¬ 
pendent variables to another. 

In a general multiple regression model, 
each observation can be represented as: 

Y = Po+pi +P2H2 +.+ P<Ek + e 


Where: 

Y 

X,...Xk 

Po 

Pi-* 


il ihc dependent variable, 
arc ihc independent variables, 
is the intercept, 
arc the regression coefficient 
of the independent variables 
xi...Xk respectively, 
is the random error or 
residual. 


In order to obtain an unbiased estimate 
of the beta(s) (P), the residuals must 
satisfy the following assumption* 
(other non residual-related assump¬ 
tions arc also needed to be met): 

O Must have a normal distribution 
with mean equal to zero and 
variance equal to a 2 for any given 
set of values of independent vari¬ 
ables; 

0 The residuals are independent or 
have no evidence of serial correla¬ 
tion. 


Continued on page 13. 
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1990 CRSP Data Information 

By Panu Sitiiwong, Academic-Computing Consultant — “ABD”in Political Science fBITNET' 
PAN'trtfUNIVMl) 

T he following information applies to CRSP tapes with data ending DEC. 1989. 

Since the Daily Return and OTC data are used frequently, they are now stored 
as cataloged data sets on one of the HDS-8083 academic mainframe system disks. 
The sample program and all sub-routines arc also stored on a system disk. In addition, 
there arc prc-compiled modules for all the sub-routines stored on disk. The prc-com- 
piled modules will help speed-up your program if you haven’t made any modifica¬ 
tions to the sub-routines. 

CRSP Monthly Index and Return File: Data are Stored on Tape 

TMS vol. 105410 

RECFM LRECL Bl.KSIZF. 

1 CRSP.MRET.COPYR1T FB 80 80 

2 CRSP.MRET.BPROGAM FB 80 4000 

3 CRSP.MRET.CPROOAM FB 80 4000 

4 CRSP.MRET.INCLUDE FB 80 4000 

5 CRSP.MRET. ACCESS FB 80 4000 

6 CRSP.MRET. UTILITY FB 80 4000 

7 CRSP.MRET.INC.PDS FB 80 6160 

8 CRSP.MRET. ACC.PDS FB 80 6160 

9 CRSP.MRET. ITILPDS FB 80 6160 

10 CRSP.MRF.T.OBJLIB U 0 32760 

11 CRSP.MRET.CA1.ENDA VB 48 32760 

12 CRSP.MRET.DATA90 VB 26516 32760 

CRSP Daily Return File: Data are Stored on Disk 

Contents QSii RECFM LRECL BLKSTZF. 

Sample Program USER.CRSP.SAMPROG FB 80 4000 

Includes Sub- USER.CRSP.INCLUDE.PDS FB 80 4000 

Routines 


Utility Sub Routines 
Sub Routine Object 
Library 
Calendar File 
Dau File 


DSii 

RECFM 

LRECL 

BLKSIZE 

USER.CRSP.SAMPROG 

FB 

80 

4000 

USER.CRSP.INCLUDE.PDS 

FB 

80 

4000 

USER.CRSP.ACCESS 

FB 

80 

4000 

USER.CRSP.UTILITY 

FB 

80 

4000 

USF.R.CRSP.OBJLIB 

U 

0 

32760 

USER.CRSP.DRET.CALENDAVB 

48 

32760 

USER.CRSP.DRET.DATA90 

VB 

27680 

32760 


Continued on page 12 


Standalone wordproccssors were in¬ 
credibly expensive and using them re¬ 
quired a good deal of training. 
Graduate students, then as now, had 
very little in the way of money to buy 
cycles on a machine just to type their 
dissertation. In fact, most of the com¬ 
puter cognoscente sneered at the idea 
of using a computer’s power to emulate 
the function of a lowly typewriter. 

How things have changed. 
Wordproccssing is now a full blown 
and ubiquitous technology and it has 
made the production of dissertations a 
much less onerous task. 

I realize, of course, that not everyone 
has access to personal computers or 
wordprocessing software. But most of 
us do and I suspect that nearly all 
graduate students have found a way to 
get their hands on both. 

Now I note another kind of technology 
being developed that is as radical a 
change to 1990 dissertation writers as 
the use of a wordprocessor would have 
been to me in the late 1970s. It consists 
of a mixture of hypermedia elements 
written on CD-ROMs or high-density 
floppies. 

The current representatives of the 
genre are but the earliest forms of 
“knowledge navigators” used primari¬ 
ly in the tutorials and instructional 
materials we have all seen whenever 
we unpack a new Macintosh or open up 
the latest version of PageMaker. 

This nascent technology will definitely 
affect the way doctoral candidates as¬ 
semble and defend their dissertations. I 
do not know what to call this technol¬ 
ogy: perhaps it does not yet have a 
name. 

But for now I will refer to it as infor¬ 
mation processing as distinct from 
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NASDAQ File : Data are Stored on Disk 


Conlenls 
Sample Program 
Includes Sub- 
Routine 

Access Sub Routines 
Utility Sub-Routines 
Sub-Routine Object 
Library 
Calendar File 
Data File (Part 1) 
Data File (Part 2) 


RECfM 

FB 

FB 


DSN 

USER.CRSP.SAMPROG 

USER.CRSP.INCLUDE.PDS 

USER.CRSP. ACCESS 
USER.CRSP.UTILITY 
USER.CRSP.OBJLIB 


USER.CRSP.NASQ.CALENDAVB 
USER.CRSP.NASQ.DATA90 VB 
USER.CRSP.NASQ.DATA90A VB 


LRECL 

80 

80 


BLKSIZE 

4000 

4000 


You can use the TMSINFO program to find out more information about data sets 
stored on tape. Please note also that CRSP has changed the format of all tapes. 
Hence, you may need to get the new sample program sub-rouune ^ 
in order to process the new data. They can be retrieved using the IhBuhlNhK 

program. 

Sample TMSINFO Program 


/INC OSJE 
RETURN 

//idnnTMS JOB (idn»i.:10, 1 )'Your Name',? ASSWORD^/rmpH" 
//EXEC TMSINFO 
//SYSIN DD * 

VOL=105410 

I* 


Sample IEBGENER Program 


/INC OSJE 

RETURN _ 

//WnnENER JOB (u/n/i,:05,l,9999)TourName’,PASSWORD=MvipH- 

II EXEC PC.M=IEBC,ENER 

//SYSPRINT DD SYSOUT=A 

//SYSIN DD DUMMY 

//SYSUT2DD SYSOUT=B 

//SYSUT1 DD DSN=xxwrx.DISP=SHR 


Where: 


xxxxx is a data set name. 



Continued on page 13. 


wordprocessing. I believe it will lead to 
the creation of the digital dissertation. 
Such dissertations will be as different 
from the typewriter-produced versions 
as those produced on wordprocessors. 

Indeed, digital dissertations will be or¬ 
ders of magnitude more sophisticated 
and have a richer information content 
than anyone can now imagine. 

So, what is a digital dissertation? In its 
most basic form it is a multi-dimen¬ 
sional “document” linking discrete, 
yet highly associated pieces of infor¬ 
mation together such that the entire 
work is a definitive statement of newly- 
discovered knowledge. 

Standard, written dissertations are es¬ 
sentially the same, with the major ex¬ 
ception that they exist in linear form, 
line after line, page after page. They 
must be carefully constructed from 
beginning to end, to bring die reader to 
the conclusions arrived at in the 
candidate’s original research. 

Due to the limited nature of this highly 
linear process, large chunks of infor¬ 
mation that do not easily lend themsel¬ 
ves to this form of presentation must be 
omitted. For example, color illustra¬ 
tions, sound, animation, and video seg¬ 
ments that might have been part of the 
original research cannot be placed in 
current dissertations. Their omission is 
due to the limitations placed on the 
standard format required of all doctoral 
candidates — a written dissertation. 

But why not make use of the entire 
range of information resources at your 
disposal, rather than limiting yourself 
to the written word? The technology is 
here, now, to incorporate all of the 
other, non-written elements of your re¬ 
search into a dissertation. All it takes is 
a bit of imagination on the part of the 


Continued on page 13. 
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CRSP continued from page 1 ? 

Sample Program Set-up 

CRSP requires that you write a FORTRAN program to access the data. The sample 
program and the sub-routines are required in order to access the data. The program, 
however, is for general purpose use and you must add your own codes in order to 
obtain particular data items. The following sample shows a job stream forCRSPdata 
processing. If you have further questions about accessing CRSP data, please contact 
Academic Computing Services, ISB 119 (565-2324). 


/INC OSJE 
RETURN 

//M/mENER JOB (idnn.:05,}.<)999yYour\amr'.PASSWORD=mvspw 
// EXE FORVCLG 

//FORT.SYSLIB DO DSN=USER.AC09.CRSP89.INCLUDE.DISP=SHR 
//SYSIN 1)1) * 

yy****»*********»»*»***»»»***,,„*,»,*, t *,, t . 

//* FORTRAN Program 

//LKED.SYSLIB DD DSN=USER.CRSP.OBJLIB.DISP=SHR 
// DD DSN=SYSl.VFORTLIB,DISP=SHR 

//OO.SYSIN DD * 

//OO.FTIOFOOI DD DSN=xxxxxxxx,DISP=SHR 
//CiO.FTl IF001 DD DSN=xxxxxxx.DISP=SHR 


Where: FT10 and FT11 must be present in the program. FT10F001 

refers to the DD for the Calendar file. The DSN is either 
USER.CRSP.DRET.CALENDA for the Daily Returns data or 
USER.CRSP.NASDAQ.CALENDA for the NASDAQ file. 

FT11F001 refers to the DD for the Data file. The DSN is 
USER.CRSP.DRET.DATA90 for the Daily Returns data. The 
NASDAQ data are contained in two separate files. Hence the DD 
is as follows: 

//CiO.FTl 1F001 DD DSN=USER.CRSP.NASQ.DATA90,DISP=SHR 
// DD DSN=USER.CRSP.NASQ.DATA90A.DISP=SHR 


lleteroscedastlcltv cont inued from page 10. 

Hctcrosccdasticity happens when the first assumption is violated. The variance of 
the residuals varies across observations. For example, the variance of the residuals 
for observations 1 to 20 is different from variance of the residuals calculated from 
observations 15 to 40. If the residuals are randomly or normally distributed, their 
variance will be the same or constant for any given set of observations. 

There arc many possibilities that render the heteroscedastic contamination. Two are 
mentioned in this article. Forexamplc, the first possibility is when you include travel 
expenditures of a cross section of households in your model. Hctcrosccdasticity is 
a reasonable problem in this situation, since low income families will almost 


Continued on page 14. 


Dissertation continue d from nape 1 ? 

doctoral candidate, some software like 
HyperCard and wordprocessing 
programs, and access to a video 
camera, a tape recorder, and a Macin¬ 
tosh. 

Let’s consider a hypothetical doctoral 
candidate to show how a digital disser¬ 
tation would run rings around your 
standard dissertation to represent the 
same fundamental research. 

Suppose our candidate is interested in 
the reproductive biology of a species of 
killer Californian butterflies 
(Obstreperous californicus var 
smogensis) that are spreading into 
Southern Colorado. 

Our researcher has concluded, after 
four years of research, that the butterfly 

is entering Colorado along old Spanish 

trading routes, but that its expaasion 
throughout the entire state is being 
limited by a particular bird that hap¬ 
pens to enjoy eating this species of 
butterfly. 

Our researcher has discovered that not 
only is the butterfly’s expansion from 
California to Southern Colorado in¬ 
fluenced by activities (c.g., the Spanish 
trade routes), but a nice natural check 
has kept the population in balance and 
prevented a devastating overpopula¬ 
tion of these nasty butterflies in the 
southwestern portion of Colorado. 

Using a video camera, our intrepid can¬ 
didate has videotaped the breeding be¬ 
havior of the butterflies, made a 
detailed visual study of its morphology 
(particularly the female/male differen¬ 
ces), and has good footage showing a 
mass ascension of over two million 
individuals in their preferred habitat. 

The research also uncovered a cache of 
old Spanish mission documents 
describing the movement of the but - 


Continued on page 14. 
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certainly have low average expenditures on travel and also a low variance of actual 
travel expenditures. High income families, on the another hand, will spend more on 
travel which will reflect in both high mean of travel expenditures and greater 

variance. 

Another possibility of heteroscedasticity is when your observations are Pooled Cross 
Sectional Time Series or Pooled Time Series. Social scientists always encounter the 
problem of loo many variables and too small a number of observations. Many 
researchers have used pooled time scries in order to increase the number of obser¬ 
vations in their study. As its name implies, the pooled time scries is the combination 
of cross sections and time series. The variables used in the pooled time series arc 
observed from a number of different cross-sections over a time span. For example, 
cities observed over a 5-year period will look like: 


OBS 

GROUP 

CITY 

YEAR 

REVENUE 
PER CAPITA 

POPULAR INCOME 
CHANGE CHANGE 

1 

1 

ABILENE 

1976 

7.4106 

0.96121 

21.5010 

2 

1 

ABILENE 

1977 

7.3416 

0.96121 

21.5010 

3 

1 

ABILENE 

1978 

7.2240 

0.96121 

24.3554 

4 

1 

ABILENE 

1979 

7.7365 

0.96121 

24.3554 

5 

1 

ABILENE 

1980 

8.1783 

0.96121 

25.4624 

6 

2 

DALLAS 

1976 

23.4137 

6.5088 

19.7725 

7 

2 

DALLAS 

1977 

19.9819 

6.5088 

19.7725 

8 

2 

DALLAS 

1978 

18.7005 

6.5088 

25.7852 

9 

2 

DALLAS 

1979 

17.8606 

6.5088 

25.7852 

10 

2 

DALLAS 

1980 

16.7092 

6.5088 

19.9652 

11 

3 

FT.WORTH 

1976 

13.8924 

4.6900 

16.3844 

12 

3 

FT. WORTH 

1977 

13.4445 

4.6900 

16.3844 

13 

3 

FT.WORTH 

1978 

13.4426 

4.6900 

24.1286 

14 

3 

FT.WORTH 

1979 

12.9661 

4.6900 

24.1286 

15 

3 

FT.WORTH 

1980 

11.8687 

4.6900 

21.3059 

16 

4 

WICHITA 

1976 

9.5915 

-2.7823 

24.6832 

17 

4 

WICHITA 

1977 

9.4000 

-2.7823 

24.6832 

18 

4 

WICHITA 

1978 

9.0924 

-2.7823 

18.4469 

19 

4 

WICHITA 

1979 

9.4597 

-2.7823 

18.4469 

20 

4 

WICHITA 

1980 

9.3594 

-2.7823 

23.325. 


When these kind of data are used in an ordinary regression analysis, the model is 
likely to be contaminated by heteroscedasticity. Applying multiple regression to the 
above data, one is sure to get unequal variances of the residuals across the observa¬ 
tions. It is obvious for anyone io expect that the error or residual variance for Dallas 
is larger than the error variance for Abilene. Although, this article is not dealing with 
alternative data analysis models, it would be clear that ordinary regression analysis 
is not appropriate for this kind of data set. More complex models are more suitable 
for these types of data. Those complex models are Pooled Cross Sectional Time 
Series Regression and Seemingly Unrelated Regression. 


terflics along the trade routes, and even 
some ancient photos showing the but¬ 
terflies flitting around Pancho Villa’s 
head outside of Tuscon in the early 
1890’s. 

Finally, our researcher has recorded the 
sounds of the butterflys’ mating call, 
recorded anecdotal stories from 
ranchers along the butterflys’ migra¬ 
tion routes, and captured on tape the 
rapacious sounds of the principal bird 
predator dining on thousands of hap¬ 
less butterflies. 

Our doctoral candidate is now ready to 
commit the results of years of research 
to close scrutiny in the form of a disser¬ 
tation. But this time, rather than writing 
it all down, our candidate chooses to 
make use of the technology available 
and place the entire research undertak¬ 
ing into a digital dissertation (with sup¬ 
porting narrative, of course). 

Instead of presenting the usual “book” 
to the committee, they are astounded to 
receive a 3.5- inch diskette, with the 
simple instructions, “Please insert this 
diskette into any Macintosh with at 
least one megabyte of memory.” 

What the committee members see, 
hear, read, and experience as they 
wander through this dissertation we 
must leave to conjecture, for each digi¬ 
tal dissertation will be unique and put 
together in a distinctly individual way. 

At a minimum, we can guess that the 
committee members will “read" about 
the research project, click on various 
parts of the screen to see the video clips 
of the butterflies ascending into a clear, 
blue Arizona sky, look at highly 
detailed images of the anatomy and 
reproductive morphology of the but¬ 
terflies, hear their mating call, and read 
from excerpted portions of old Spanish 
mission documents about the 


Continued on page 15. 
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In the example above, the travel expenditures, it is easier to correct for hcteroscedas- 
ticity than the second example. Ordinary regression can still be used to analyze the 
data, but only after some of the variables arc transformed by using one of the 
arithmetic functions, for example log, sine, or square root. Depending on the 
characteristic of the distribution of the residuals, these transformations will stabilize 
the variance of the residuals. However, remedies for heterosccdasticity are beyond 
the scope of this article. The focus of this article is to detect hetcroscedasticity if it 
is present. 

Test Statistics 

There is a univariate lest of homogeneity of variance in SPSS under MANOVA. This 
test is called the Bartlett test. It is a univariate test because the only variable that we 
arc interested in is the residual. In SAS, there is a short routine that gives the Bartlett 
test. The only requirement of the Bartlett test in both statistical packages is the index 
variable. The index variable must be a scries of integers. In the sample data set above 
the index variable is GROUP. The index number is assigned such that members of 
the same cross section belong to the same group. The following steps arc necessary 
to obtain the Bartlett test for the residuals. 

O Apply OLS regression to your model. 

0 Extract the residuals from the regression model. 

© Apply MANOVA, if you arc using SPSS or apply the Bartlett routine in case of 
SAS to the residuals. 

Examples 

Two examples arc illustrated in this article, one using SAS and another using SPSS 
The samples were processed on SAS PC and SPSS PC+, however these two 
programs will work fine on both CMS and MVS on the mainframe. 

In the first example, on the following page, the residuals from the regression model 
arc saved in a new SAS data set named RCITIES. The name of the residual variable 
is RESID. PROC SUMMARY uses the RCITIES data set to calculate the variances 
of the residuals for each city and save them in the sas data set named BART. Two 
new variables are created in the BART data set; VARIANCE and NUM. 
VARIANCE is the name of the variable which contains the variance of the residuals 
for each city. NUM is the number of observations for each city. In the next step, SAS 
uses the BART data set to calculate the CHI-SQUARE for the BARTLETT test 
statistics. Output from this program is displayed in the next table. 

In this example, the CHI-SQUARE statistic is very significant This indicates that 
we can reject the null hypothesis that the variance in all cities are the same. It means 
that hcteroscedasticity is present in the model. A normal regression model is not 
suitable for this data scl A new model should be used for the data analysis. 

The second example shows how to use SPSS PC + to analyze the same data set 
Although, this program is processed on SPSS PC +, the same program can be used 
on the mainframe version. 


spread of the butterflies during the 
spring of 1876. 

As they continue to explore the rich 
amount of material supporting the dis¬ 
sertation, the committee will learn as 
much about the subject as the candidate 
— an in-depth analysis of this par¬ 
ticular research problem seen from a 
totally new perspective. Understanding 
of the subject is enhanced; new 
knowledge is generated and trans¬ 
mitted. 

Defending this type of dissertation 
would be no less difficult than defend¬ 
ing a traditional, typed version. The 
candidate would still be expected to 
show a mastery of the discipline, com¬ 
plete command of particulars en¬ 
countered in the research, and have the 
full support of all of the committee 
members to be granted a doctoral de¬ 
gree. 

But think how much more interesting 
and rich the entire experience would be 
for all concerned. The years of effort 
that went into obtaining all of the 
pieces or the research puzzle, the video 
segments, the original documents, etc., 
would all be part of the finished 
product 

The spurious example I chose here was 
from my own field. But the concept 
works equally well for any discipline, 
for any research undertaking. There is 
no limit to what a digital dissertation 
could cover. As with the research itself, 
it is limited only by the imagination and 
creativity of the author. 

I look forward to a graduate student 
creating the first such dissertation. ■ 


Benchmarks 


April 1991 


Page 15 



Thesis and Dissertation 
Production 



Example 1: Using SAS to test for Heteroscedasticity 


0001 

0002 

0003 

0004 

0005 

0006 

0007 

0008 

0009 

0010 

0011 

0012 

0013 

0014 

0015 

0016 

0017 

0018 

0019 

0020 

0021 

0022 

0023 

0024 

0025 

0026 

0027 

0028 

0029 

0030 

0031 

0032 

0033 


1JBNAME SASIN ’C:Y; 

Kl ^$1 INCHANGE PCSALES 1JCDKBT; 

OUTPUT OUT=RCITlHSR*RF.SID; 

RUN; ...... 


/• BARTLETT ROUTINK STARTS^HHRF... . 

PROT SUMMARY NWAY;/*COMPUTE & STORE THE VARIANCE & */ 
ClSgROUR ? NUMBER OF OBSERVATIONS FOR EACH LEVEL •/ 

OUTPUT OUT=B ART V AR=V ARIANCF. N=NUM; 

RUN; 

DATA _NULL_; 

SET BART END=EOF; 

S^Sf FREEDOM FOR THE CURRENT LEVEL •/ 

SLOGVAR+LOC.VARI'N; 

TOTN+N; 

NVAR=N* VARIANCE; 

SNVAR+NVAR; .. 

A+l; I* NUMBER OF LEVELS V 

SFRACT+l/N; 

IF HOF THEN DO; 

M=IOTN*LOG(SNVAR/TOTN)-SLOGVAR; 

C= 1 +(1 /(3* (A-1 )))* (SF'R ACT- 1/TOTN); 

CII1SQ=M/C; 

PROBCI II=PROBCl 1I(CI IISQ.(A-l)); 

Al.PI I A=1 -PROBCII1; 

PUT*''bARTLETT’S TOST: CHI-SQUARF.=* CHISQ 1 AI-PHA=' ALPHA V 
F.ND; 

RUN; 


Output From Example 1 


SAS 10:59 Friday, March 22, 1991 3 
I BARTLETT'S TEST: CH1-SQUARE=23.65245826S A1.PHA=0.0000937633. 


In Example 2, SPSS adds a new variable 
to the active file after REGRESSION. 
The new variable is RESIDUAL. 
RESIDUAL contains the residuals cal¬ 
culated in the REGRESSION step. 
MANOVA will use this new variable in 
the next step to calculate BARTLETT 
test statistics based on 5 cities. The 
output from this program is shown in 
the next table. 

The significant level indicates that there 
is no reason to accept the null 
hypothesis that the variances in all cities 
are equal. The conclusion is there is 
heteroscedastic contamination in the 
model. 

More information about hetcros- 
ccdaticity can be found in econometric 
textbooks. Some are mentioned in the 
references below. 
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Example 2: Using SPSS to test for Heteroscedasticity 


GET FILE='C:'CmES.SYS'. 

REGRESSION /VARIABLES ALL 

/MErilODENTER'TRUTHYR POC11ANGE INCHANGE PCSAI.ES LPCDEBT 
/SAVE RESID (RESIDUAL). 

MANOVA RESIDUAL BY GROUP (14) ^ 

/PRINT HOMOGENEITY (BARTLETT) 

/DESIGN. _ -___ 


Output From Example 2 


Univariate Homogeneity of Variance Tests 



Variable.. RESIDUAL 

Residual 

5.94439. P= -000 

Bartleu-Box F (4.3750) = 
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T he ECHO OFF and 
ECHO ON com¬ 
mands will not work on the 
TT00 dial un lines . Instead, 
you should use ECHO 
LOCAL and ECHO 
REMOTE. 
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ICPSR Summer Programs 

By Panu Sittiwong, Academic Computing Consultant (BUNBT: MNU@lwrvMl) 


Report from 
SUG116 


T he Inter-university Consortium for Political and Social Research (ICPSR) is 
spoasoring a series of quantitative methods training workshops and seminars 
during July 1, 1991 to August 23,1991. This summer program is divided into two 
sessions. The first session starts on July 1 and ends on July 26. The second session 
starts on July 29 and ends on August 23. Each session will have a series of lectures, 
and workshops. The first session schedule includes: 

Lectures: 

Quanatitative Methods of Program Evaluation. * The Logic of Data Analysis: 

„ Measurement and Design. 

Ba6ic Mathematics. Mathematics for Social Scicn- 

. , tists. 

Intro to Computing. 0 Advanced topics in Social Re¬ 

search. 


Workshops: 

Mathematical Models: Game Theory. 
Likelihood Models and Statistical Inference. 

Latino Research Issues. 

Intro, to Regression Analysis. 

Multivariate Statistical Methods. 

Special Workshops: 

Survey of Income and Program Participation 
(SIPP). 

Logit and Log-linear Models. 

Network Analysis. 

The second session schedules include: 

Lectures: 

Intro to Computing. 

Matrix Algebra. 

Dynamic and Longitudinal Analysis. 

Workshops: 

Regression Analysis 

Structural Equation (Causal) Models 

Mathematical Models: Rational choice. 

Categorical Data Analysis. 

Special Workshops: 

LISREL Models: Introduction. 

Utilization of Data Resources from the 1990 
Census 


Quantitative Historical Analysis. 
Quantitative Analysis of Crime 
. and Criminal Justice 
Intro, to Statistics and Data 
Analysis 1. 

Regression Analysis. 

Scaling and Dimensional 
Analysis. 


Management of Machine- 
Readable Social Science Infor- 
t mation. 

Regression Diagnostics. 


Formal Models of Social Sys¬ 
tems. 

Advanced topics in Social Re¬ 
search. 


* LISREL Models. 

Time Scries Analysis. 

Intro, to Statistics and Data 
Analysis II. 

Advanced Analysis of Variance. 


LISREL Models: Intermediate. 
Inlergenerational Relations: Re¬ 
search Issues, Methodology and 
Data. 


For more information contact either Dr. Valeric Martinez — Political Science — 
(817) 565-2276 or Panu Sittiwong — Computing Center — (817) 565-2324. ■ 


By Dr. Philip Baczewski, Academic 
Mainframe User Services Manager (BITNFT: 
AC12@UNTVM1) 

R ecently, Panu Sittiwong and I at¬ 
tended the sixteenth annual SAS 
User’s Group International (SUGI) 
conference, held February 17-20, in 
New Orleans. Louisiana. With over 
4000 people in attendance, there was a 
lot of opportunity to exchange informa¬ 
tion with people using SAS for a variety 
of applications. Attendees included 
university and corporate users of SAS 
as well as a very large contingent of 
staff members from SAS Institute. 

One highlight of the conference was the 
opening session in the New Orleans 
convention center. With almost all con¬ 
ference attendees present, all of one 
hall was used for a multi-media pro¬ 
gram emceed by conference chairman 
Martin J. Rosenberg of MAJARO Info- 
Systems, Inc. This included presenta¬ 
tions from several key managers from 
SAS Institute as well as from Dr. James 
Goodnight, creator of SAS and Presi¬ 
dent and founder of SAS Institute. 

SAS 6.07 

Much of Goodnight’s talk centered 
around the next version of SAS, version 
6.07, and its implementation of SAS’s 
Multi-Vendor Architecture, in which 
the greatest portion of the SAS system 
can run on a number of differing plat¬ 
forms with only a small portion needed 
to interface to that particular system. 
Also discussed was the availability of 
this new version in four stages over the 
next year. 

Version 6.07 is scheduled for first 
quarter 1991 release for the IBM 
RS/6000, the DECstalion 5000 series, 
the HP 9000/300 series, MIPS, Apollo, 
and Data General AViiON worksta¬ 
tions. The end of the second quarter 
should yield this release for Sun-3 and 
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Sun-4 operating systems, Silicon 
Graphics, HP 9000/800 scries, and the 
IBM AIX/370 operating system. Ver¬ 
sion 6.07 for IBM MVS and CMS, as 
well as VAX VMS is scheduled to ship 
by the end of third quarter, 1991, and 
the fourth family of version 6.07, due 
by end of fourth quarter, includes IBM 
VSE, OS/2, Windows 3.0, AIX PS/2, 
SCO/UNIX for 386 and 486 machines, 
and the Sequent and NeXT worksta¬ 
tions. 

One exciting component of version 
6.07 is the utilization of the native 
Graphical User Interfaces (GUI) of the 
Unix, OS/2, and Windows 3.0 operat¬ 
ing systems to enhance the uscabilily of 
the SAS system. Goodnight mentioned 
that many of the future developments 
in the SAS system will be geared to 
users of GUIs. Along these lines, he 
announced that version 6.04 will be the 
last version of SAS for the PC/MS- 
DOS platform, with the OS/2 and Win¬ 
dows versions updated as new releases 
are developed. He stated that version 
6.04 will be maintained for several 
more years, but recommended the 
move to a GUI. 

Goodnight went on to announce two 
new products entering the SAS line. 
SAS/CALC, described by Goodnight 
as “the ultimate spreadsheet,” is 
scheduled to ship with release 6.07 for 
MVS, CMS, and VMS. Also an¬ 
nounced was SAS/EIS, “an object- 
oriented, applications development in¬ 
terface to the SAS Applications Sys¬ 
tem.” This product was designed for 
development of specially tailored inter¬ 
faces to the SAS system, especially Ex¬ 
ecutive Information Systems. 

Goodnight’s speech included many ex¬ 
citing developments for SAS, and the 
end of the opening session of SUGI was 
quite exciting as well. Even though the 
conference was held a week after Mardi 
Gras, SAS arranged to stage a mini- 
Mardi Gras parade right there in the 
convention center. This included 
motorcycle policemen, several bands, 
floats, and of course, a lot of beads 


being thrown from those floats. SAS 
even provided Mardi Gras masks to 
each attendee to help get into the spirit 
of the evening. 

In the three days following the opening, 
there was the opportunity to attend a 
variety of paper sessions covering 
many aspects of the SAS system. 
There was also an extensive demo area 
which allowed hands-on experience 
with the version 6.07 on a number of 
different platforms. I even had the 
chance to try out SAS on the NeXT 
workstation. (My impression was that 
it was a bit slow on the 68030-bascd 
"cube," and will probably be much 
more responsive on a 68040 
NeXTStation or on an upgraded cube.) 
Also in the demo area were problem 
desks manned by SAS Institute staffers. 
I had the opportunity to talk at length 


they’re know in SUGI lingo), as well as 
in several papers. Several points quick¬ 
ly became evident Version 6.06 was a 
complete re-write of SAS for MVS and 
CMS, switching from PL/I to C as the 
development language. As a result, 
some aspects of this version yield 
poorer performance than the previous 
version 5.18, and in come cases cause 
compatibility problems with version 
5.18. Many of these problems are al¬ 
leviated in version 6,07, but for the 
mean time, the BOFs and paper ses¬ 
sions yielded some valuable tips for 
making SAS perform better on both 
MVS and CMS, some of which we can 
apply here at UNT to the test versions 
of SAS 6.06 on both CMS and MVS. 

Panu attended a BOF on the Window? 
3.0 version of SAS. He also had the 
opportunity to attend the SAS-L BIT- 


If you are a Windows 3.0 user, you can participate in 
evaluating the [SAS] product. Call Panu Sittiwong at 
565-2324 and ask to have your name put on the list. 


with Kicth Collins, the manager of 
product development for the CMS plat¬ 
form, and express to him our concerns 
here at UNT related to SAS on CMS. 

Another result of the demo area was 
that we were able to become a Beta- 
Test site for SAS 6.07 under Windows 
3.0., and expect to receive the software 
some time during the summer semester. 
If you are a Windows 3.0 user, you can 
participate in evaluating the product. 
Call Panu Sittiwong at 565-2324 and 
ask to have your name put on the list. 
You will need a 386SX or more power¬ 
ful PC with a minimum of 6M of 
memory in order to effectively run the 
SAS software under Windows. 

One topic of interest throughout the 
conference was the performance of 
SAS version 6.06 on the MVS and 
CMS platforms. This was discussed in 
several “Birds-of-a-Feather” spccial- 
interest-group sessions (or BOFs, as 


NET list BOF, and came away with a 
hot-pink official SAS-L SUGI 16 T- 
shirt. This was emblazoned with the 
unofficial “slogan” of SAS-L, RTFM, 
interpreted by one SAS staffer in a later 
session as “Read The Friendly 
Manual.” The SAS-L BOF was a 
chance to associate faces with the 
names of those posting messages to that 
mailing list. 

All-in-all, it was a memorable con¬ 
ference that yielded much valuable in¬ 
formation. SAS is aggressively market¬ 
ing its system for solutions in the cor¬ 
porate world, while at the same time 
continuing to provide its powerful 
statistical analysis capabilities. Of 
course, another reason this conference 
will linger in my memory may have 
something to do with its location: the 
Crescent City: ships steaming up and 
down the Mississippi; the St. Charles 
street car; the French Quarter, beignets 
and coffee at Cafe Du Monde. ■ 
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STATLIB: Public 
Domain Statistical 
Software 

This article was adapted from an ar¬ 
ticle by the same name in the February 
1991 issue of the University of Min¬ 
nesota Academic Computing Services 
Newsletter, acs. 

S TATI .IB is an electronic mail sys¬ 
tem to distribute public-domain 
statistical software and data. STATLIB 
is a minor modification of the NETLIB 
software [sec related article on this 
page and/or consult Benchmarks, 
February 1991 (page 9) and Novcm- 
bcr/Dcccmbcr 1990 (pages 16-19) for 
information on NETLIB). All of the 
STATLIB routines come in source code 
form and require the appropriate com¬ 
piler (usually FORTRAN). Keep in 
mind that (hough the routines are free 
of charge, they come with no guaran¬ 
tee. Sec Table 1 for a list of STATLIB 
libraries. 

Some types of problems can occur 
when using the routines. We recom¬ 
mend that you carefully test and check 
each routine. In particular, look for 
machine constants in the codes. 
Machine constants for a CRAY might 
produce incorrect results when used on 
an IBM-PC. 

Accessing STATLIB 

The index files and the software itself 
are available from STATLIB by send¬ 
ing requests in the form of electronic 
mail to STATLIB@LIB.STAT.CMU.EDU. 
The requests arc one-linc commands 
that can be the subject part of a mail 
message or can be the mail message 
itself. STATLIB interprets the com¬ 
mand, executes it, and sends you a mail 
message response. ■ 


apstat 

Select algorithms transcribed from the journal of Applied 
Statistics 

crab 

Kodiak Island king crab survey data. 

datasets 

Various data sets. 

directory 

Lists of addresses and E-mail addresses of statisticians 

Disease 

National Notifiable Diseases Data. 

general 

Software of general statistical interest. 

griffiths-hill 

Applied Statistics algorithms contained in the book of that 
name by Griffiths and Hill 

multi 

Multivariate Analysis and Clustering. An annotated directory 
and selected algorithms. 

S 

S functions, device drivers and related software. 

s-news 

Archives of the S-ncws mail, in digest format. 

xlispstat 

Luke Tierney’s XlispStat system for UNIX systems. (Luke 
Tierney is a professor in the University of Minnesota’s Statistics 
department.) 


Table 1: List of STATLIB Libraries 



Command 

Requests 

SEND INDEX 

Index listing of all available software. 

SEND INDEX FROM collection 

Index of routines within a specific 
collection; for example: 

SEND INDEX FROM S 

SEND routine FROM collection 

Routine contained in a particular 
collection; for example: 

SEND SPLINES FROM S 

Tkble 2: Some STATLIB Commands 


More NETLIB Public Domain Mathematical Software 

This article was adapted from an article by the same name in the February 1991 
issue of the University of Minnesota Academoc Computing Services Newsletter, acs. 

r J^hc following has been recently been added to NETLIB@ORNL.GOV: 

CRPC Software available fro the NSF Science and Technology Center 

for Research in Parallel Computation. 

The following have recently have added at NETLIB@RESERCH.ATT.COM. 

Special functions by D. AMos (=TOMS/644). 

Basic Linear Algebra Subroutines, level 3, for matrix*matrix 
operations. 

Another miscellaneous library for software written in C. 

Analysis and design of linear control systems. 

Spline fitting routines for various kinds of data and geometries. 
FORTRAN to C converter. ■ 


Amos 

BLAS3 

C 

cascade 

Diercks 

f2c 
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NSF Grants Bulletin 
Board Established 

From an article by Geraldine Son- 
nesso (ACSGFRKg'URVM) that ap¬ 
peared in the State University of New 
York at Buffalo Computer Center 
newsletter, Interface (Fcbruary/March 
1991). 

T he National Science Foundation 
(NSF) Grants electronic bulletin 
board was established by the NSF 
Division of Grants and Contracts. The 
purpose of the bulletin board is to pro¬ 
vide a forum for discussion and com¬ 
munication among grant ad¬ 
ministrators, NSF grant officials, and 
other interested parties on grant ad¬ 
ministration and policy matters. 

In addition, NSF posts official notices 
of interest to the grant administration 
community. 

Users who would 1 ike to be added to the 
electronic mailing list should address 

requests to grants-re<|ucst@>NSF.GOV. 
Notices may be posted to the bulletin 
board at grants<ffiNSF.GOV. 

Please include with your request, your 
name, title, institution, U.S. Mail ad- 


Baczewskl Compositions 
Performed 

T wo compositions by Dr. 

Philip Baczewski, Acad¬ 
emic Mainframe User Services 
Manager, were performed at the 
Society of Composers Inc. Region 
VI conference, March 7-9 at UT 
Arlington. 1=3=1, for computer- 
generated tape was presented on 
March 7, and Seven Preludes for 
Dinner, for clarinet and guitar, was 
performed on March 9.7 =3=1 was 
realized during the summer of 
1990 while Dr. Baczewski was at¬ 
tending a seminar on programming 
using NeXT workstations at Stan¬ 
ford University’s Center for Com¬ 
puter Research in Music and 
Acoustics. ■ 


THE BITNET CONNECTI 

By Dr. Philip Baczewski. B1TNFT 1NFORF.P (BITNET: ACWt.-NTVMl) 


This Column is a continuing feature of Benchmarks intended to present 
news and information on various aspects of the BITNET wide area 
network. 

Guidelines for Electronic Mail on BITNET (or Anywhere 
Else) 

B ack in the “old” days people were forced to perform their communication 
through writing, by hand, on blank sheets or paper and then conveying those 
blank sheets to others. It is even reported that the U.S. Postal Service, today known 
primarily for its ability to deliver multitudes of “junk mail” and bills to your door, 
conveyed large numbers of these hand-written communiques, known as “letters.” 
The exchange of letters tended to foster a certain style of communication: letters 
were received and thoughtfully read; letters were generally more formal than spoken 
communication; letters had permanence and could be saved for later reference; in 
responding to letters, people would often think and then write one sentence, think 
and write another sentence, etc. Now days, electronic mail has revolutionized 
communication. There is no need for paper or messy hand writing utensils. You no 
longer have to use that formal writing style. Mail is received and read with heretofore 
unknown speed. It’s now possible to dash off a reply to an electronic mail message 
without even thinking. 

Well, maybe it’s not quite that cut and dried, but electronic mail docs seem to have 
the ability to evoke what arc known in e-mail circles as “flames:” emotional 
responses to messages which don’t necessarily do much for fostering effective 
communication. Norman Z. Shapiro and Robert H. Anderson, in a report prepared 
for the National Science Foundation and published by the Rand Corporation, list 
several possible causes for the flame phenomenon: 

* Difficulty in determining the formality of a message from its appearance. 

* Attempts at humor, irony, sarcasm, and wit arc often misinterpreted. 

• Cues such as body language [or voice inflection] are lacking in electronic mail. 

° The ease of an immediate “reply" encourages “off the top of the head” respon- 



° Electronic messages containing hasty or ill-chosen words can stay in electronic 
in boxes or can be printed in a way that gives them importance never intended. 

° Although anonymity is often mentioned as a factor, we have observed no sig¬ 
nificant difference in “flaming" between remote correspondents who don’t 
know each other personally, compared with communication among people who 
know each other. 

Shapiro and Anderson go on to give several suggestions for minimizing the possible 
problems of “escalating emotions.” 
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Carefully label message that 
have a deliberate emotional 
content. Sometimes just the 
annotation “Flame! Flame!” 
alerts the reader to the fact that 
the writer knows he or she is 
being emotional. 

Resist the temptation to fire off 
a response. Write the response, 
file it away, and wait 24 hours. 
Reconsider the response later, 
in the light of a new day (and 
perhaps a rereading and 
reinterpretation of the original 
message). 

Use alternative media to break 
the cycle of message-and- 
response. A telephone call or 
personal conversation can do 
wonders, when we can use 
body language, eye contact, 
and the other cues we’ve 
developed. 

Just as in other human situations, the 
development of an etiquette can help 
solve some of the problems which 
potpntia\ly arjsc with electronic com¬ 
munication. John Quarterman, in his 
book entitled The Matrix, offers a num¬ 
ber of suggestions concerning e-mail 
etiquette when sending messages to 
others or posting messages to BITNET 
LISTSERV or USENET mailing lists: 1 

Electronic mail is not like other 
media. Treating e-mail just 
like the telephone, paper 
mail, or any other medium 
can lead to misunderstand¬ 
ings and mistakes. 

Emulate experienced users. See 
how those already posting to 
mailing lists make the most 
effective use of those forums. 

Be brief. Often a few well- 
chosen words are better than 
long-winded elaborations. 

Label your message. Choose a 
title that fits the subject and 
stick to it. 

1 Quarterman. John S., The Matrix (Digital 
Press. 1990) 34. 


LIST of the Month 


Each month we will highlight one of the BITNET LISTSERV 
Special Interest Group (SIG) mailing lists.This month's list... 

HUMANIST@UTORONTO 

Coordinator Willard McCarty (MCCARTY@UTOREPAS) 
HUMANIST is a LISTSERV discussion group for people who 
support computing in the humanities. Those who teach, review 
software, answer questions, give advice, program, write 
documentation, or otherwise support research and teaching in 
this area are included. The list is primarily intended for interac¬ 
tion rather than publication or advertisement. 

I BITNET is not just for those "techie" types, you know, and it’s 
not just limited to the sciences. It’s equally useful for exchange 
of iiformation for a variety of areas. This list provides a good 
example in its forum for those using computers in support of a 
"non-technical" field of study, the humanities. If you are inter¬ 
ested in subscribing to this mailing list, send the following 
command, either via an interactive message or as the first line 
of a mail message, to USTSERV@UTORONTO: 

SUBSCRIBE HUMANIST <yourname> 

where <your name> is your first and last name. 


Remember your audience. Use language, references, and subjects that will 
be comprehensible and not objectionable. 

Choose an appropriate medium andforum. Use a conference or mailing list 
on a topic related to that of your message. 

Identify yourself. Sign your message with some appropriate information 
such as your name and affiliation. 

Post new ideas. Try not to repeat what has already been said except in brief 
confirmation. 

Respond to the topic and not the person. Try to understand what the person 
is saying. If you can’t understand what the person is saying, ask. If you 
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must criticize someone, give them a chance to respond. If you comment 
on the style of a message, respond to the content as well. 

Read other messages before responding. Others may have already made 
the same obvious response. 

Don't respond in anger. Wait a few minutes or hours, or even until the next 
day. If you are still angry when you respond, say so. 

Give the benefit of the doubt. Misiakes, misunderstandings, and ignorance 
are far more common than maliciousness. 

Be careful with humor and sarcasm. Many people have trouble recognizing 
these things even in person. Some networks have developed typo¬ 
graphic conventions to get around the difficulties of expressing sub¬ 
tleties of expression through ASCII characters. One of the more univer¬ 
sal is that UPPER CASE means shouting. Another is the use of the 
sideways “smiley face,” :) or:-), to indicate lack of serious intent 
Do be encouraging and polite. The most effective encouragement is often a 
simple response acknowledging a posting. 

Discourage when necessary. But do it privately and politely when possible. 

Don’t discourage at all unless you’re sure it’s needed and that you are an 
appropriate one to do it. 

Assume Permanence and ubiquity. Mail posted to discussion lists and some¬ 
times even mail to individuals may be saved permanently, with or 
without your knowledge, and may be read by anyone, at any time, 
anywhere. Remember that even if a mail message has been delete, it 
may exist somewhere on a backup tape. 

It’s not enough just to observe etiquette. Quartcrman also provides some valuable 
guidelines for e-mail ethics: 

Observe copyrights. 

Cite sources. 

Be careful with private correspondence. Do not redistribute private cor¬ 
respondence without permission. Don’t read other people’s mail without 
permission. If you receive a message by accident, return it to the sender 
or forward it to the intended recipient. 

Be honest. Don’t distribute false information, and don’t pretend to be some¬ 
one you aren’t in order to take unfair advantage of someone else. 

Someone is paying the bills. Remember that what you post may cost others 
time and money. Try to stick to useful information distributed to ap¬ 
propriate people. 

Don't post harmful instructions or information. 

Resource sharing systems are not like anything else. A computer network 
is neither like a home computer system nor like any other single com¬ 
puter system. The damage that can be caused by mistakes or 
malevolence increases with the power and extent of the system. 

People depend on networks and conferencing systems. 

Don't leave a security hole unfixed. This applies to system administrators, 
system vendors, and users who choose obvious passwords. 

Don’t use security holes to cause damage. 

Some of these points of etiquette or ethics are obvious; others perhaps wouldn’t 
occur to you. By following these guidelines we can make electronic mail on 
BITNET or any other network a very effective and efficient means of communica¬ 
tion. Or we can wax nostalgic for the good “old” days of paper, fountain pens, 
envelopes, stamps, waiting five days to get a letter.... ■ 



BENCHMARKS 

FORUM 


BENCHMARKS FORUM is intended to 
serve as a vehicle for answering questions 
that may be of general interest to the user 
community. If you have a question, please 
send electronic mail to the Benchmarks 
editor (BITNET: AS<M®UNTVM1) or write it 
down and drop it by the Computing Center. 
We will try to answer it in the next issue. 

Question: / am fairly new to comput¬ 
ing and have a lot of questions about 
various topics. Where is the best 
place for me to look for answers to 
my questions? 

Answer: One of the things computers 
do for us is give us access to all kinds of 
information. Since many people have 
similar questions about various topics, 
“Frequently Asked Questions” (FAQs) 
have become a hot item on the Internet 
The following is a posting taken from 
comp.archives (on ANU on the V AX). 

Frequently Asked Questions re-up¬ 
date TSFAQ17C. ARC 
Sun 3-Mar-91:1 have updated my 
Frequently Asked Questions col¬ 
lection almost immediately after 
the last update to be 
/pc/ts/tsfaql7c.arc. The collection 
includes among other things the 
prerecorded instructions for 
gnrho.iiwasa.fi and I noticed that 
they were somewhat out of date. 
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Benchmarks Reader/User feedback Is encouraged. Send all letters, suggestions,etc 
to (AS 04 © untv Mi), FAX 817-565-4060 or to the Benchmarks Editor at: 

Academic Computing Services 
University of N. Texas Computing Center 
P.O. Box 13495, Denton,Texas 76203 


TSFAQ17C.ARC - Frequently asked questions & ans 


Filename 

Comment 

Date 

Time 

PAQ.INP 

Document (a readme) 

03-03-91 

19:18:10 

PAQ.NWS 

Newi announcement about tsfaq 

03-03-91 

19:11:36 

FAQNKWS.TXT 

Q&As relating to UscNctnews 

03-03-91 

19:13:10 

FAQPAS.TXT 

Turbo Pascal questions <6 answers 

03-02-91 

19:05:16 

FAQPHOGS.TXT 

Questions answers about programs 

03-03-91 

19:15:56 

FAQUOTE.TXT 

Postings and email of interest 

01-25-91 

16:08:18 

P02ANS.TXT 

Q&As on uwasa.fi archives & more 

03-03-91 

18:57:30 

TSPBOO.WF 

List of PD programs from T.Salmi 

01-25-91 

16:05:54 

VAASA. INF 

Info: Finland, Vaasa, U of Vaasa 

02-02-90 

11:52:54 



The wares arc available by anonymous ftp from garbo.uwasa.fi, Vaasa. Finland, 
12K.214.t2.37. or by using our mail server (use the latter if. and only if you don't have 
anonymous ftp). If you are not familiar with anonymous ftp or mail servers, I am 
prepared to send prerecorded instructions on request. (If you don't get the instructions 
from me within a few days, it will mean that your email address cannot be reached by a 
simple email reply. Contact your system manager for devising a proper mail path for you, 
because unless you do, you wouldn't be able to utilize the mail server anyway). (North 
American users should first consider a near site because of the overseas load.) 

Prof. Timo Salmi Moderating at garbo.uwasa.fi anonymous ftp archives 128.214.12.37 
School of Business Studies. University of Vaasa. SF-65101, Finland Internet: 
ts@chyde.uwasa.fi Funet: gado::salmi Bitnet: salmi@finfun 


Besides these FAQ archives, FAQs arc published monthly on some Internet lists. 
Peruse the topics you are interested in on ANU and watch fora FAQ posting. If you 
have questions about ANU, the Internet, or anything else, contact Academic Com¬ 
puting Services (ISB 119, 565-2324) and someone will help you. ■ 



Information 
Rresources 
Council News 


Minutes provided by Sue Harrison, IRC Recording Secretary 

T he Information Resources Council met on Tuesday, January 15, 1991 and 
conducted items of business which are briefly summarized here: 

Chairman Vondran reported that the General Access Computer Lab Report was on 
the agenda of the Information Resources Steering Committee meeting on Jan. 3. 
After discussion on the use of Student fees to fund General Access Labs, a study 
committee was named to assist in developing an acceptable formula for the allocation 
of any academic microcomputer monies that would be collected by means of the 
general computer fee. Vondran reiterated that the General Access Computer Lab 
Report with its recommendations, prepared by Cengiz Capan’s Commiuce, was 
officially approved by the Information Resources Steering Committee with the one 
exception of method of funding, which this new committee will now address. 


Chairman Vondran reported that he and 
Coy Hoggard met with the Information 
Resources Steering Committee on Jan. 
3, at which time the Commiuce ap¬ 
proved the recommendation of the IRC 
and the specific recommendation of 
Hoggard’s sub-committce to upgrade 
the administrative mainframe com¬ 
puter. President Hurley also approved 
the recommendation and the subject 
will be brought before the Board of 
Regents at their February meeting. 
Chairman Vondran commended Coy 
Hoggard and his committee for their 
representation of the IRC and the Com¬ 
puting Center. 

The recommended usage policy for the 
new Academic Unix System which was 
prepared by Dave Molta’s subcommit¬ 
tee, was briefly discussed by the IRC. 
Cengiz Capan moved that the policy be 
approved; the motion was seconded by 

Don Grose, and unanimously passed. It 
was agreed that the usage policy should 
be publicized to the university com¬ 
munity through Benchmarks , through 
distribution of the IRC meeting 
minutes, as well as by direct contact 
with major users such as Computer 
Science and the College of Business. 

Dave Molta reported that his subcom¬ 
mittee on Microcomputer Data In¬ 
tegrity was not prepared to present 
recommended solutions at this lime. 
He stated that the committee will move 
forward on the issue and welcomes any 
input from the Council members. Cen¬ 
giz Capan suggested that Molta’s com¬ 
mittee consider a broader aspect of the 
subject and look at overall disaster 
recovery needs. He further suggested 
that the committee deal with the issue 
of providing support centers for net¬ 
works that would be consistent across 
campus. It was pointed out that plan¬ 
ning is necessary to guard against 
having computer networks distributed 
across campus without having enough 
appropriate technical support available 
for them. Molta stated that there are 
two functioning groups of network 
managers who are dealing with the tac¬ 
tical end of the policy; one is a Network 


Benchmarks 


April 1991 


Page 23 


General Information 


Managers Group, made up of users, 
that provides daily support; and another 
is a Micro Network Advisory Group 
made up of people from across campus. 
Both of these groups include ad¬ 
ministrative as well as academic 
pocple. 

Richard Harris announced that he will 
be requesting a new Computing Center 
position, which will possibly be a 
Director of Networking and Distribu¬ 
tive Computing Support. This position 
would combine microcomputer and 
data communications support within 
the Center. 

Chairman Vondran asked that the 
Strategic Planning Subcommittee meet 
to establish the strategy for planning for 
the next five years. Richard Harris 
agreed to continue chairing that sub¬ 
committee and to meet prior to the next 
IRC meeting on Feb. 19. 

Coy Hoggard reported that the Policy 
for Adding Terminals and Workstation 
Access to the Central Administrative 
Computer is still in use. Though this 
was only implemented as a temporary 
solution to the mainframe overload, it 
is a workable policy until the 
mainframe can be upgraded. The con¬ 
census was that there was no need to 
change the policy at this time. 

Chairman Vondran asked for discus¬ 
sion on expanding the IRC member¬ 
ship. It was agreed that the IRC Charter 
would be consulted for direction on 
how to expand the Council; and 
Vondran slated that he would ask the 
Vice Presidents for their suggestions on 
additional representation from the ad¬ 
ministration. Vondran also asked IRC 
members to think about this subject so 
that it can be discussed further at the 
February 19 meeting. 

Richard Harris reported that the 
University’s Initial Operating Plan has 
been approved by the DIR. ■ 



OS/MVS JCL Utilities: IEBCOPY 

By Cathy Hardy, Academic Database Consultant (BITNHT: AC5S@UNTVM1) 

This is the fifth in a series of articles dealing with JCL (job control 1 anguage). This 
series is aimed at the current JCL user who would like to have a better understanding 
of statement use, utilities, and coding options. If you are not currently a JCL user, 
but would like to begin learning about JCL, Academic Computing has a free handout 
available in ISB 119. Stop by and ask for "IBM Job Control Language," or contact 
an Academic Mainframe Users Support consultant for further information. 


I 


EBCOPY is an IBM data set utility used to copy or merge partitioned data sets. 


This utility can be used to; 
create a backup of a partitioned data set. 

copy data sets which were unloaded to a direct 
access device (load), 
rename members. 

compress data set6 in place. 


copy a partitioned data set to a 
sequential data set (unload), 
replace data set members. 

exclude members from a data set 
that is being copied, loaded, or 
unloaded, 
merge data sets. 


° recreate a data set that has used 16 extents or 
is out of directory space. 


IEBCOPY Example 

In this example, members are to be selected, excluded, unloaded, and copied. 
Processing will occur, as follows: 

O unload, excluding members 
0 unload, selecting members 
© load and copy to merge members 


//JOBCARD 

//STEP I EXEC PGM=IEBCOPY 
//SYSPRINT DD SYSOUT=A 

//PDSI DD DSN=USER.XXNK.CLASS.NAME.A,UNIT=SYSDA, 

// VOI.=SER=ACAD03,DISP=Ot.D 

//PDS2 DD DSN=USER.XXNN.CLASS.NAME.B,UNTT=SYSDA, 

// VOI.=SER=ACAD03.DISP=OlD 

//SF.Q1 DD DSN=USER.XXNN.CLASS.NAME.NEWA,UNIT=SYSDA, 

// VOI-=SER=ACAD03,DISP=(NBW,KEEP), 

// SPACE=(TRK.(S,1).RLSE) 

//SEQ2 DD DSN=USER.XXNN.CLASS.NAME.TAPB.UNIT=TAPE9. 

// VOI-=nnnnnn.DISP=(NEW.KEEP), 

// LABEL=(,SL) 

//UPDATE DD DSN=USER.XXNN.CI-ASS.NEW,UNIT=TAPE9, 

// VOL=nnnnnn,DlSP=OLD.LABEI.=(,SL) 

//MERGE DD DSN=USER XXNN.CLASS.MERGE.UNIT=SYSDA, 

// VOL=SER=ACAD03,DISP=Ol.D 

//SYSUn DD DSN=&&TEMP1,UNTT=SYSDA.DISP=(NEW.DELETE), 

II SPACE=(TRK.(5,1).RLSB) 

//SYSUT4 DD DSN=&&TEMP2,UNIT=SYSDA,DISP= (NEW,DELETE), 

II SPACE=(TRK.(5,t)jU-SE) 

//SYSIN DD • 

COPY OUTDD=SEQUNDD=PDSl 

EXCLUDE MEMBER=(D.Q 
COPY OUTDD=SEQ2JNDD=PDS2 
SEUiCT MEMBER=(A.K) 

COPY OUTDD=MERGU.lNDD=((NEWUP,R).PDSl ,PDS2) 
EXCLUDE MEMBER=A 

r 
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In the previous example: 

• PDSl DD defines a partitioned data set 
that contains six members (A, B, C. D, 
E, F) and resides on volume ACAD03. 

• PDS2 DD defines a partitioned data set 
that contains three members (A, K, L) 
and resides on volume ACAD03. 

• SEQ1 DD defines a new sequential data 
set on volume ACAD03. 

• SEQ2 DD defines a new sequential data 
set on tape. 

• UPDATE DD defines an old sequential 
data set that is the unloaded form of a 
partitioned data set that contains eight 
members (A. B, C. D. M, N, O. P) on 
tape. 

• MERGE DD defines a partitioned data 
set that contains six members (A. B. C, 
D. 0, R) and resides on volume 
ACAD03. 

• The first COPY statement indicates the 
start of the first unload operation. (The 
input data set is partitioned, the output 
data set is sequential.) 

• The first EXCLUDE statement 
specifies the members D and C are to be 
excluded from the unload operation 
specified by the preceding COPY state¬ 
ment. 


The excluded members are sear¬ 
ched for on every input data set and 
are always omitted. 

• The second COPY statement indicates 
the start of the second unload operation. 
(Again, the input data set is partitioned, 
the output data set is sequential.) 

• The SELECT statement specifics that 
member A and K are to be included in 
the unload operation specified by the 
preceding COPY statement. 

Selected members are searched for in a 
Low-to-High (a - z) collating sequence, 
regardless of the order in which they are 
specified; however, they arc copied in 
the same physical sequence in whidi 
they appear on the input partitioned data 
set 

Once a member of a data set has been 
found, no further search is made for that 
member. When all members have been 
found, the copy or load step is ter¬ 
minated even if all input dau sets have 
not been searched. For example: If A 
and B are specified, and A is found on 
the first of three input data sets, it is not 
searched for again. Then, if B is found 
on the second input dau set, the opera¬ 
tion is successfully terminated without 


the third data set being searched, even 
if an A and B exist on the third. If, how¬ 
ever, A is not found on the first input 
data set, then it (the first data set) will 
be searched for B before going to the 
second input data set to search for A. 

• The third COPY statement indicates the 
start of the copy and load operations. 
Ihe replace option is specified for the 
UPDATE dau set; therefore, members 
in this dau set replace identically 
named members on the output dau set. 
Ihe first INDD data set is an unloaded 
data set that is to be loaded. The second 
and third INDD data sets are partitioned 
data sets that are to be copied. (The 
input data sets are sequential and parti¬ 
tioned; the output data set is parti¬ 
tioned.) 

For additional information on IBM 
utilities, check your IBM OS/VS2 MVS 
Utilities manual. The example came 
from OS/VS2 MVS Utilities, GC26- 
3902-1, p. 6-44. 

Next issue: IEBGENER and 
IEBPTPCH. ■ 


MICROCOMPUTERS 


Some PC 
Viruses 
Described 

T his is the Final listing (the first 
appeared in the March 1991 
issue of Benchmarks) of PC viruses 
described by Fridrik Skulason, author 
of ihe shareware antiviral program F- 
PROT (frisk@rhi.hi.is). These descrip 7 
lions are as of January 1991 and were 
distributed with F-PROT 1.14. 
[There’s still some good material here 
for social scientific research — cd.] 


• Old Yankee — There is some con¬ 
fusion regarding the various 
“Yankee Doodle” viruses. They all 
originated in Bulgaria and play the 
tunc “Yankee Doodle”, but their 
structure is different. This is the first 
one. It only infects .EXE files, in¬ 
creasing their length by 1961 bytes. 
When an infected program is run, 
the virus will search for a non-in- 
fected file. When it has been in¬ 
fected, the virus plays the melody, 
before the original program is ex¬ 
ecuted. At the very end of the virus, 
the word “motherf-er” appears. 

Another version of the same virus is 
also known. The main difference is 
the length - the variant is only 1624 
bytes long. 


• Oropax — This virus probably 
originated in W. Germany. It is not 
very harmful - when it activates it 
will just repeatedly play three 
melodies. The virus infects .COM 
files, other than COM- 
MAND.COM. Infected files grow 
by 2756-2806 bytes, becoming a 
multiple of 51 bytes in length. This 
virus stays resident in memory, but 
it will not infect other programs 
when they are executed. Instead it 
will search for a file to infect when 
files are created or deleted, a sub¬ 
directory is created or the access 
mode of a file is changed. A few 
other functions may also trigger an 
infection. 

The virus uses a random number 
generator to decide when to become 
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active, and if it docs, it will start 
playing 5 minutes after an infected 
program is run. 

• Parity — This is a simple, 441 byte 
virus, which will infect one .COM 
file when an infected program is run. 
The virus may emulate a parity error, 
displaying PARITY CHECK 2 on 
the screen and halting the computer. 

• Perfume — A .COM infecting virus 
of German origin, that will some¬ 
times ask the user a question and not 
run the infected file unless the 
answer is “4711", which is the name 
of a perfume. This virus will look 
for COMMAND.COM and infect it 
unless it is already infected. In¬ 
fected files grow by 765 bytes. In 
the most common variant of the 
virus the questions have been over¬ 
written with garbage. 

• Phoenix — This is a family of 
viruses from Bulgaria, containing 
1226, Proud (1302), Evil (1701) and 
Phoenix (1704). They all infect 
.COM files, and use a complicated 
encryption method, which compli¬ 
cates detection somewhat. 

• Piter —This 529 byte Russian virus 
has not been analysed yet, but it in¬ 
fects only COM files, overwriting 
the beginning of the programs, and 
placing the original code at the end 
of the infected file. 

• Plastique (AntiCAD) — This is a 
family of viruses from Taiwan. 
They arc members of the Jerusalem 
family, perhaps based on the Fu 
Manchu variant, but arc long - one 
2900, one 3012 and three 4096 byte 
variants. One of the 4096 byte 
variants is also known as “Invader”. 
The 4096-byte variants also contain 
code for infecting the bool sector. 
The reason the viruses are also 
known as AntiCAD is that they are 
targeted against the AutoCAD pro¬ 
gram. When a program named 
ACAD.EXE is run or sometimes 
when Ctrl-Alt-Del is pressed, the 
viruses will activate, overwriting 
data on floppy disks and hard disks, 
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This column is intended to serve as a forum for sharing useful tips on making more 
productive use of microcomputers. If you have a tip that you feel may be of use to 
campus users, submit it to the Benchmarks editor for possible inclusion in a future 
issue. 


Software License and Waranty Agreements 


This Micro-Tip was reprinted from an article byDebbi Buffamanli (MICDEBQUBVMS,) 
that appeared in the State University of New York at Buffalo Newsletter, INTER¬ 
FACE (April! May 1990. page 19). 

S oftware, like books and musical recordings, is subject to certain copyright laws. 

In fact, the whole question of the ownership and distribution of intellectual 
property is an important one, and will probably become more crucial as times goes 
on. If developers can’t be compensated for the research and development lime 
they’ve invested in one particular software package, they will be a lot less likely to 
continue to develop new products tomorrow. In this article I’ll be discussing th® 
implications of copyright laws, and what you can expel if they are violated. First 
let’s sec exactly what the new user may Find inside their new software package. 

Imagine the joy of purchasing a new, very expensive and long-awaited piece of 
software; something you’ve always wanted, and Finally mail-ordered. Then, upon 
opening the package, you’re confronted with something resembling the following 
notice: 

IMPORTANT: READ CAREFULLY BEFORE OPENING — By opening this 
sealed disk package, you indicate your acceptance of the following agreement: 

The rest of this slightly foreboding document details the following sons of things 
you may read on this enclosure; contractual things like ... 

• You have the right to use the software on a single terminal; or one copy on a 
licensed computer network. 
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• The software is owned by the vendor of the product; and is protected hvirc 

Seat the S5t5^ like any otoer'S2 m aSSS 
(e g- A BOOK OR MUSICAL RECORDING) LUPYRIGHTCD MAT ERIAL 

• You may make one copy of the product for backup or archival purposes 

’ praduct 10 * !i,,8lc ha,d disk p " ,vided you k “p 

• You cannot rent or lease the software 

' =sse 

• You may not reverse engineer, decompile, or disassemble the software 

’ ES?! WarC P2ckage conlains both 5 -25" and 3.5" diskettes, you mav onlv use 
the diskettes appropriate for the drives in your own computer. * * 

Since this is a contractual agreement, the vendor also has certain obligations and 
responsibilities. These obligations include: 

’ £ forgodJJ. Wi " PC ' f0rm “ substanlial, y” in accordance with the documen- 

• Warranties, which are usually limited to 90 days from the dale of purchase 

• w * rn ? n ‘ i f whi <* may include the return of the purchase price or 

replacement of software at the vendor’s option. P 

• The above is not valid outside the United States. 

• Other warranties, which vary from state to state, may apply. 

• In most cases, vendors will not be held liable for damages WHATSOEVER (i e 

zziszf - profiE ° r infonMut " 

In short, you are directly responsible for the actual use of the software whether nr 
no that software really does what the vendor’s marketing SS^yslfSeT 
to you are only allowed to use it in the Patou specified 

U® 1 JjfPpens if you makc lols of unauthorized copies or decompile the software’ 

• software ! 11 * 1 aSSUme y0U ^ agrecing 10 a binding contract upon opening the 

Anr"? Cpt ^ usc a PP |ics 10 software; this can include the effect upon the 

potential market for, or the value of, the copyrighted work. 

Ifon and fiESLW ** 1 r Y ‘" C8al C ° Pying ’ and you "W ** subject to prosecu- 
Uon and fines which can (in some cases) exceed S50.000.00 and up to 5 years in 

• U.S copyright law is not clear whether the purpose of a back-up copy includes 
2 * 2 “ m ^ C ZV fthefU in COmc cases ' «n only E usS ifl 

SEE a ‘ a f " r ^ a 

In short, read those agreements! H 


ytrusts continmvt fyp fn na( , P . 9 f, 

as well as garbling the contents of 
the CMOS. 

• Polimer — The name of this vims 
is derived from a text message 
which infected programs may dis¬ 
play when executed: 

A le'jobb kazeiu a POLIMER kazeua ' 
Vegye ezi I 

The virus is a simple, 512 byte 
■COM infeetor, which seems to be 
based partially on the Vienna virus. 

• Pretoria (June 16th) — Pretoria is 
a direct-action, .COM infecting 
virus from South Africa. It over¬ 
writes the first 879 bytes of infected 
files with itself, and stores the 
original 879 bytes at the end of the 
file. This makes it impossible to 
fully restore programs shorter than 
879 bytes, as their original length is 
not stored anywhere. 

When an infected program is ex¬ 
ecuted, the virus searches the entire 
current directory for .COM files to 
infect. As it uses a full-depth recur¬ 
sive directory search, this may take 
considerable time on an XT-class 
machine. 

On June 16th the execution of an 
infected file will cause all entries in 
the root directory to be changed to 
’ZAPPED’. June I 6 U 1 is the day 
when the Soweto riots first broke 
out. 

The vims uses a simple substitution 
encryption. 

Prudents—This is a Direct-Action 
Spanish virus, 1205 bytes long 
which infects .EXE files. It is 
probably written by the same author 
as the PSQR variant of the 
Jerusalem virus, as it checks if that 
vims is present, when it is mn. This 
vims will overwrite the last 32 bytes 
of any file it infects, possibly 
destroying the victim. The vims ac¬ 
tivates on May 1st - May 4th of any 
year, interfering with disk writes. 

Saddam — This vims is based on 
the Stupid vims, and is also from 
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Israel. It is 919 byte long. The most 
obvious effect of the vims is to dis- 
play: 

HEY SADDAM LEAVE QUEIT 
BEFORE I COME 

. Shake — Thakc is a primitive 476 
byte .COM infecting vims. It may 
infect the same program over and 
over, and infected programs may 
cause a reboot when executed. The 
name is derived from a string which 
is found inside the vims: 

Shake well before use ! 

This message is occasionally dis¬ 
played when an infected program is 

run, followed by termination of the 
program. 

. slow — This is a variant of the 
Jerusalem virus from Australia. 
Some reports indicate it may cause 
a slowdown of the system, but this 
has not been confirmed. The length 
is 1716 bytes, with extra 5 bytes 
added at the end of .COM files. The 
major difference between this virus 
and the standard Jerusalem virus is 
the use of a simple encryption in the 
Slow virus. A variant of Slow has 
been reported in California. This 
2126 byte variant is known as 
“Scou’s Valley". 

• South African “Friday 13.” — 
This is one of the oldest viruses 
around, but it is very rare. In fact, it 
may even be extinct, apart from 
copies in the hands of virus re¬ 
searchers. It is a "direct action" 
virus, that will seek out one or more 
programs to infect, every time an 
infected program is mn. It will only 
infee;t .COM files, which grow by 
415.544 bytes, depending on the 
variant in question. The original 
virus is 419 bytes long. Like the 
Lehigh virus it can be stopped simp¬ 
ly by making .COM files read-only. 
Most other viruses are not bothered 
by this, however. A version of this 
virus, called “Virus-B”, where the 
destructive part has been disabled 
has been distributed for demonstra¬ 
tion purposes. 


Stupid (Do-Nothing) — The “Do- 
Nothing” or “Stupid” virus is not a 
well written one. In fact it is so 
badly written that in many cases it 
will simply cause the system to 
“hang”, instead of properly infect¬ 
ing it. It seems that this virus was 
created by a lousy programmer, 
somewhere in Israel. It is not a 
serious thrcaL It will only work on 
machines with at least 640K of 
memory, because it always tries to 
hide itself at the same address, start¬ 
ing at 9000:0000. This virus infects 
.COM files, which grow by 583 
bytes. The original infected pro¬ 
gram distributed by the author con¬ 
tained the siring: 

(c)Stupid 1989 Virushmock! 

A variant of this virus, also from 
Israel, is somewhat longer - 919 
bytes. It is known as the “Saddam” 
virus, because of the following text 
message: 

HEY SADDAM 

LEAVE QUEIT BEFORE I COME 
. Subliminal — As this wirus vas 
found close to where the Dyslexia 
was discovered and as the viruses 
are clearly related, they arc believed 
to have been written by the same 
person - “Subliminal” probably just 
being an earlier version of the same 
virus. The Subliminal virus derives 
its name from its activity - it will 
continously display the text 
“LOVE, REMEMBER?" for a frac¬ 
tion of a second in a comer of the 
screen. 

• Suomi—This is an encrypted 1008 
byte virus from Finland. It infects 
COMMAND.COM as soon as an 
infected program is mn. The virus 
has not been fully analyzed yeL 
. Superhack — This is a 1077 byte 
vims, which infects .COM files in 
an ordinary way, but .EXE files are 
infected in a similar way, which 
prevents the vims from infecting 
.EXE files longer than 64K. The 
vims was written in Scotland and is 
known there as “Murphy”. 


, svC — This is the first Russian 
“stealth” vims. It is 1689 bytes long 
and infects COM and EXE files. It 
has not been analysed yet, but it 
contains the text string: 

(c) 1990 by SVC.Vers. 4.0 
• Sverdlov — This is an encrypted, 
1962 byte Russian virus, which was 
first discovered in Sverdlov. It is 
reported to display a moving banner 
across the screen, while playing the 
Russian national anthem. After¬ 
wards drive C: is reported to be 
inaccessible and a low-level format 
is required to make it usable again. 

. Svir— Thisisan unremarkable 512 
byte direct-action .EXE file vims. 

Its effects (if any) are not yet 
known. 

. Sylvia—This virus is a bit unusual, 

to say the least. It contains the fol¬ 
lowing message: 

This program is infected 

by a HARMLESS 
Text-Virus V2.1 

Send a FUNNY postcard to : Sylvia 
Verkade, 

Duinzoom 36b, 

3235 CD Rock anje 
The Netherlands. 

You might gel an ANTIVIRUS 
program. 

It will display this message when an 
infected program is executed, but if 
the above text is tampered with, the 
following message, (which is stored 
in an encrypted form) will appear 
instead: 

F— YOU LAMER !!!! 
system halted...$ 

Some people have a weird sense of 
humor... 

As the text above indicates, the 
vims originated in the Netherlands. 
When an infected program is run, 
the virus will seek out up to 5 .COM 
files to infect. It will search drive 
C: and the current drive. The three 
system files, COMMAND.COM. 
IBMBIO.COM and IBMDOS. 
COM are not infected. The virus 
adds 1301 bytes to the beginning of 


Page 28 


April 1991 


Benchmarks 


MICROCOMPUTERS 


Ihc files it infects (and also 31 bytes 
to the end), but docs no other 
damage. The girl mentioned above 
exists, but she says that she has no 
idea who the author is. It is very 
likely that he knows her, though. 

• SysLock — The SysLock virus in¬ 
fects .EXE and .COM files. It is a 
“Direct Action” virus that will 
search for files to infect when an 
infected program is executed. The 
virus will first modify the length of 
any program it infects, so it be¬ 
comes a multiple of 16 bytes. Then 
the virus code, 3551 bytes is ap¬ 
pended to the file. 

One unusual “feature" of this virus 
is that it will search the disk for the 

string “Microsoft” and change it 
into “MACROSOFT”. The virus 
will not infect programs if the en¬ 
vironment contains SYS- 
LOCK=@. 

Four other variants of this virus are 
known. Two are called “Macho”, 
since they both will change the 
string “Microsoft” into 
MACHOS OFT”. One is very 
close to the original virus, the other 
a bit different. The third variant, 
“Advent” will activate in December 
and then play “Oh, Tannenbaum”. 
The last variant is 2232 bytes long 
and known as “Cookie”, because it 
may display the message 
I want a COOKIE! 

Taiwan 

This virus seems to have appeared 
in Jan ’90. It is a direct-action 
.COM infector, which activates on 
Oic 8th day of any month, overwrit¬ 
ing the FAT and root directory of 
drives C: and D: Two variants are 
known, one is 708 bytes, but the 
oiheroneis743. Infected programs 
sometimes “hang”, for some un¬ 
known reason. 

• Tenbyte — This is a 1554 byte long 
.COM and .EXE infecting virus that 
was by accident posted to the V- 
ALERT electronic mailing list, 
which is intended for urgent mes¬ 


sages regarding virus infections. 
Just like the “Stupid” virus, it will 
only work on machines with at least 
640K memory. It activates on Sep¬ 
tember 1st, and stays active until 
December 31 si While active, it 
will corrupt all disk writes, deleting 
the first ten characters of any data 
written to the disk and add ten “gar¬ 
bage” characters at the end. This 


will cause destruction of data files, 
as well as preventing the compila¬ 
tion of any programs. 

• Traceback — The “Traceback” 
virus produces a screen display 
similar to that produced by the Cas¬ 
cade virus. 

There are of course differences, 
since the viruses are totally unre¬ 
lated. Every file infected with 

‘Traceback” contains the name of 
the file that infected it. This makes 
it possible to trace the path of the 
infection. 

Another difference is that it is pos¬ 
sible to make the chapters “jump” 
back up, by pressing keys on the 
keyboard, after all the characters on 
the screen have fallen down. 

There are two variants known of 
this virus, but the size is the only 
significant difference. The original 
virus is 3066 bytes long, but the 
variants are 2930 and 3031 bytes 
long. 

TUQ — This is a simple, non- 
re markable, 453 byte long .COM 
infecting virus, probably written in 
Germany. It has no effects other 
than replication. 

Turbo — The name of the virus is 
derived from a text string contained 
inside iL The virus family contains 
two viruses, one 512 byte known as 
‘Turbo Kukac”, but the other as 
Turbo-448. Both viruses only in¬ 
fect COM files, and may cause pro¬ 
gram crashes. Forexample.COM- 
MAND.COM will always crash if 
infected. 

T^rku — This is one of two viruses 
from Finland, the other being the 


“Suomi” virus. It will infect .COM 
and .EXE files. The length of the 
vims is either 1232 or 1472 bytes 
depending on the file type. No inter¬ 
esting effects have been observed, 
other than occasional “repeats” of 
certain keys, when pressed. 

• Vacsina and Yankee Doodle — A 

programmer in Bulgaria has written 
a number of viruses - 50 different 
variants or so. Two of the variants 
number 5 and 39 “escaped” to the 
West in 1989. One of the features 
of virus in this family is that they 
contain a version number system, 
simi lar to that used in the “Den Zuk” 
vitus. If a virus in the family finds a 
file infected with an older version of 
itself, it will remove the infection 
and re-infcct with the new version. 


A number of the variants play the 
tune "Yankee Doodle", but the 


the original “Yankee Doodle" virus, 
which is called "Old Yankee” by the 
F-FCHK program. 


This family can be divided into two 
groups, one consisting of versions 
numbered below 38, but the other 
one versions 38 and upwards. The 
first group is identified as “Vacsina” 
variants by F-FCHK. and the 
second one as “Yankee Doodle” 
variants. However, this division is 
based on differences in the internal 
structure of the viruses - several of 
the “Vacsina” viruses also play 
“Yankee Doodle”. 


The “Vacsina” viruses seem to have 
been written originally to infect 
only .COM files. .EXE files are 
also infected, but that is done in two 
steps. First a short piece of code is 

added to the end of the file. Then a 
JMP command is added at the front 
of the file. This code seems to be 
based on the code used in FOR- 
MAT.COM and CHKDSK.COM in 
some versions of MS-DOS. When 
executed it will relocate the .EXE 
file. This makes the .EXE file struc¬ 
turally equivalent to a .COM file, so 
it can be infected as one. 
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The second group (versions 38 and 
upwards) infects .EXE files in a “or¬ 
dinary” way. 

Compared to most other viruses, 
these arc fairly harmless. In the first 
versions a beep (BELL) is heard, 
every lime a .COM-type file is suc¬ 
cessfully infected. As mentioned 
before, some of them play “Yankee 
Doodle”, sometimes at 5 o’clock, 
but other variants play the tunc 
when the computer is rebooted by 
pressing Ctrl-Alt-Del. 

The latest versions of the viruses 
contain several advanced features - 
including self-correcting Hamming 
code, disabling of debugging tools, 
and the ability to search for and 
remove the Ping-Pong and Cascade 
viruses. 

One related virus is also known, but 
it is shorter than any of the Yankee 
viruses, only 1049 bytes long. It 
seems to have been created by 
removing parts of the “Yankee” 
code. 

Vcomm — An .EXE infecting virus 
that came from Poland. It is not 
very well written, but easy to study 
because the commented source 
code was included in the sample 
that arrived from there. When an 
infected program is run, it will in¬ 
fect one .EXE file in the current 
directory. Infected programs arc 
first padded so their length becomes 
a multiple of 512 bytes. Then the 
virus adds 637 bytes to the end of 
the file. It will also install a resident 
part that will intercept any disk 
write and change it into a disk read. 
• VFSI (Happy) — This virus is 
believed to have been written by a 
student at the "Vish Finansovo- 
Stopanski Instiuit” in Bulgaria. It is 
a small, 437 byte, direct-action 
.COM-infecting virus. It can be 
stopped by making files read-only. 
When it activates, it may display the 
following messages. 

HELLO!!! HAPPY DAY and 
SUCCESS from virus 1.1 VFST-Svisiov 


• Victor — This is a 2442 byte .EXE 
and .COM virus from the USSR, at 
least according to the text found 
inside it: 

Victor VI .0 The Incredible High 
Performance Virus 

Enhanced versions available soon. This 
program was imported from USSR. 

Thanks lo Ivan 

Little is yet known about its effects. 

• Vienna — This virus, also called 
DOS-62. UNESCO and 648 will 
only infect .COM files. When an 
infected file is run, the virus will 
search for an uninfected file and 
infect it. One out of eight files in¬ 
fected is destroyed, by overwriting 
the first few bytes with instructions 
that will cause a restart when the 
program is run. 

Infected files can be easily found 
because they contain an “impos¬ 
sible” value (62) in the “seconds” 
field of the time stamp. 

Unfortunately the source code to 
this virus has been published in a 
book Computer viruses: A High- 
Tech Disease , so it will probably 
become very common in the future. 
This version was modified slightly, 
in order to make it a little less harm¬ 
ful - it would only infect files in the 
current directory. 

The virus appends 648 bytes to the 
files it infects, buta646 byte variant 
is also known. 

One variant of this virus, “Lisbon”, 
has been found in Portugal. It has 
clearly been modified and reas¬ 
sembled - probably in order to fool 
signature-type anti-virus programs. 
This virus overwrites the beginning 
of the programs it destroys with 
AIDS”. The GhostBalls virus is 
also closely related. Several other 
variants have been reported in Bul¬ 
garia. Some of them are so different 
that they have received a new name, 
“New Vienna”. The Bulgarian 
variants are similar to the original 
virus, but the changes include: Dif¬ 
ferent length - 435,367 354 and 348 


bytes. Different damage function - 
formatting of hard disk. Critical 
error handler added. 

Other variants exist as well. One, 
named ’Violator’ is 1055 bytes 
long, and it contains the following 
text strings: 

TransMogrificd (TM) 1990 by RABID 
N'tnl Development Corp. 

Copyrighi (C) 1990 RABID ! 

Activation Date: 08/15/90 - Violator 
Strain B (Field Demo Test Version) 

•NOT TO BE DISTRIBUTED* 

The text seems to indicate the exist¬ 
ence of another version, which has 
not yet been reported anywhere. 
Perhaps the most unusual variant is 
“Father Christmas” or “Choinka”, 
which was discovered in Poland. It 
is 1881 bytes long, with mast of the 

extra length devoted to a Christmas 
greeting. The Monxla or “Time’ 
variant is 939 bytes long, and has 
different effects, depending on the 
exact time when it activates. 

• Virdem — This 1336-byte, direct 
action .COM-infecting virus was 
written in 1986, which makes it one 
of the oldest viruses in existence. It 
was written by R. Burger, the author 
of Computer Viruses: A High-Tech 
Disease. It will not spread unless 
modified, because the virus makes 
it quite clear that the program has 
been infected. Virdem overwrites 
the first part of the program and 
appends the original code to the end 
of the file. At least two variants of 
this virus are known - one with all 
the text strings in German, but the 
other is an English language ver¬ 
sion. In addition, a modified 792 
byte variant is also known, where 
all the text messages have been 
removed. 

• Virus-90 — The most interesting 
fact regarding this virus is that the 
author of it is known. He uploaded 
the virus to a number of BBS, 
saying that the source code was 
available for around $20. The virus 
is a simple .COM infcctor that adds 
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857 bytes to any file it infects. It 
will only infect files on drive A: and 
B: but it would be easy to “fix” that. 
An infected program will display 
the message “Infected!” when it is 
executed, but otherwise the virus 
docs nothing at all. The virus was 
uploaded for educational purposes, 
according to the author, but has now 
been removed. 

A “new and improved” version, 
Virus-101 also exists. It infects 
.EXE files as well as .COM files, 
and is somewhat variable. Some at¬ 
tempts were made to make it dif¬ 
ficult to disassemble or modify the 
virus, but this “protection” is easily 
defeated. The virus contains one 
unusual feature, it infects COM- 
MAND.COM by overwriting it, in 
the same manner as the Lehigh virus 
does, so no change in length is 
visible. Unlike Virus-90, this virus 
was not made publically available. 

• Voronezh — This is a 1600 byte 
virus from Eastern Europe, which 
infects COM and EXE files. It over¬ 
writes the beginning of COM files, 
placing the original code in 
encrypted form at the end. EXE 
files arc also infected in an unusual 
way - the original CS:PC is not 
changed, but the first 5 bytes of the 
program code are overwritten with a 
FAR CALL to the virus code. A 
related virus, 600 bytes long is also 
known, but it is only able to infect 
COM files. 

• VP — First reported in April ’90, 
this virus is of the direct-action 
.COM infecting kind. It contains 
one unusual feature - at the begin¬ 
ning of the virus a variable number 
of NOP instructions (0-15 in num¬ 
ber) are added. This is probably 
done in order to confuse “on-the- 
fly” virus scanners. The virus then 
appends 909 bytes containing the 
virus code. 

• W13 — This is a rather primitive 
.COM infecting virus. Two variants 
are known, the first one is 534 bytes 
long, but the second, with some bugs 


corrected, is only 507 bytes long. 
The variants arc both of the “Direct 
Action” type and do nothing inter¬ 
esting. They are based on the Vienna 
virus, but mark infected files by set¬ 
ting the “month" field to 13, instead 
of setting the "seconds” field to 62. 
This virus originated in the Soviet 
Union. 

• Whale — This is a recent, rather 
remarkable virus. It is long, 9216 
bytes and able to infect COM and 
EXE files. The increase in file size 
is not visible though, while the virus 
is active in memory, as it uses 
several advanced “stealth” methods. 
Other effects of the virus are not 
known, but one infected program 
displayed the following message 
when run: 

THE WHALE IN SEARCH OF THE 8 
FISH I AM ’-knzyvo) 1 IN 
HAMBURG addr error D9EB.02 

Most of the virus is devoted to 
encryption and code which moves 
blocks of virus code around. This 
overhead results in a considerable 
slowdown of infected systems. 

• Wisconsin — Like the AntiPascal 
viruses, this one is targeted against 
Pascal programs. It even includes 
the string: 

Death to Pascal — in encrypted form. 

The virus is 825 byte long and only 
infects .COM files. 

• XA1 — The XA1 virus overwrites 
the first 1539 bytes of infected 
.COM files with itself and stores the 
original code at the end of the file. 
On April 1st, a part of the virus will 
activate - overwriting the boot sec¬ 
tor with code that will cause the 
computer to “hang” on next boot-up. 
The virus will also activate on 
December 21st and stay active until 
the end of the year. It will then dis¬ 
play a Christmas tree, and the text 

Und er lebt doch noch: Der 
Tannenbauml Frohe Weihnachten 

• Zero Bug — The “Zero Bug” wili 
mark infected files in the same way 
as the Vienna virus, placing 62 in the 


“seconds” field of the timestamp of 
the .COM files it infects. Apart 
from this, the viruses are very dis¬ 
similar. 

This virus will search for COM- 
MAND.COM, using the value of 
the COMSPEC environment vari¬ 
able to locate the file. Then it will 
remain resident, hook INT 60 and 
infect every .COM file run. After 
some time has passed, a “smiley” 
(ASCII 1) will appear on the screen 
and “eat” any zero it finds on the 
screen. 

The virus seems not too well written 

- containing some unreachable 
code, but it is unusual in some ways. 

When it infects a file, it will add 
1536 bytes in front of the original 
code, just like the Agiplan virus. It 
also contains one “feature” that will 
probably be more used in the future 

- if the vims is active in memory and 
you look at a directory containing 
infected files, the virus will make 
the directory entries appear as they 
were before the infection. That is, 
you will not see any increase in file 
length. TTiis method is also used by 
some of the latest viruses from Bul¬ 
garia. 

• Zero Hunt (Minnow)—This virus 
infects programs by overwriting 
them, but unlike most overwriting 
viruses it will generally not destroy 
the host program. This is because it 
will only overwrite unused areas 
within the program. The vims sear¬ 
ches for a block of at least 416 zero 
bytes, and places itself there. As 
such programs are rare, this reduces 
the chances of the vims spread¬ 
ing.! 


Benchmarks 


April 1991 


Page 31 



VAX/UNIX SYSTEMS 


VAXCLUSTER USAGE STATISTICS 


February Top Ten Programs: CPU Time Used 


Program 

Description 

CPU Time 

Percent of ToUl 

t. User programs 

Compiled Pro puna 

13 10:53:46.36 


63.1 

2. MAXCLAS 

CIS Image V*x*.ci*irg 

t 07:52:26.28 


6.2 

3. NEWS 

ANIJ New* Utility 

1 05:20:23.78 


5.7 

4. D1SKEHPER 

DUk Optim uer 

021:25:59.05 


4.2 

S. XY//Y 

Chu Utility 

009:46:19.34 


1.9 

6 . BACKUP 

Diak Baekupa 

008:55:37.11 


1.7 

7 NNTP TCPWIN 

New. Tiunfci Utility 

008:43:53.46 


1.7 

8 . CLUSTR 

OIS Image Pnee«in* 

008:09:19.13 


1.6 

9. MAH. 

VMS Mail 

007:19:36.41 


1.4 

10. EOT 

Editor 

007:13:5292 


1.4 

ToUl 


21 07:22:45.53 




February Top Ten Programs: Frequency of Runs 

Program 

Description 

Number of Runs 

Percent of Tout 

1. LOGINOUT 

Uk» Injin 

88011 

19.8 

2. SET 

VMS Utility 

55817 

116 

3. DIRECTORY 

VMS Utility 

40042 

9.0 

4. User programs 

Compiled Program* 

34066 

7.7 

5. DELETE 

VMS Utility 

32855 

7.4 

6 . SEND 

BrrNRT meauje Utility 

24553 

5.5 

7. SYSI.OCilN 

U«ef l.ogei 

17259 

3.9 

8 . MAIL.SERVER 

VMS Mail Server 

16727 

3.8 

9. EDI 

Editor 

15296 

3.4 

10. MAIL 

VMS Mail Utility 

13091 

2.9 

Total 


444252 



Compressing Your VMS Mail File 

Taken from an article by Tony McCracken that appeared in the January!February 
1991 NewsBytes, the newsletter of Northern Arizona University Computer Services. 

F requent users of the VAX MAIL utility can save a considerable amount of disk 
space by compressing their mail file. As old mail messages are deleted and 
new messages arrive, the file MAIL.MAI becomes fragmented. Compressing the 
mail file defragments the MAIL.MAI file and returns the reclaimed disk space to 
the user’s account. 

The people most frequently affected by mail file fragmentation are those who 
subscribe to listservers and/or receive and delete large quantities of mail. Users who 
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Edited by Ken Corey. 
VAX Programmer/ 
Operator (BrrKET: 

KWC@UNTVAX) 


w. 



r elcome to the Best of the 

BBC rnlnmn This r.nhimn 


highlights some of the more inter¬ 
esting and useful discussions on 
the UNT BBS. For those of you not 
familiar with the BBS, here is how 
to log into the UNT BBS. 

• Sign-on by typing CALL DEC at the LAN 
prompt and then entering BBS as your 
Username at the VAX prompt. 


• II you are already logged-on to the 
VAX cl us ter, type BBS at the $ prompt. 


The opinions expressed in this column 
do not necessarily reflect the views of 
Academic Computing Services or the 
Computing Center. Also, information 
in Beet of the BBS has not been 
checked for accuracy. 


FILELIBRARY 


#1565 5-FEB-1991 10:52:46.92 
Subject: MS-Kermit 3.10 
Hi, 

I've uploaded the latest version (3.10) 
of MS-Kermit in the IBM.COMM 
area. The file name is 
KERM310.ARC. Included in this ar¬ 
chive is a list of changes in the new 
version. 

Billy 


#1602 12-FEB-1991 14:15:28.07 
Subject: New FPROT-114 Signature 
File 

I’ve uploaded the latest FPROT-114 
signature file into 1BM.IMMUNE. It 
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is called F114SIG1.TXT. When you 
download it, rename it to SIGN.TXT 
(the old version is not needed any 
longer). The new file allows 18 more 
viruses to be detected. 

Billy 


UTILITIES 


#1795 5-MAR-1991 09:41:19.25 
Subject: 4 DOS 

I uploaded a COMMAND.COM re¬ 
placement called 4DOS into the 
IBM.UTILITY area. 

It replaces several utilities at once in¬ 
cluding CED and some of the directory 
utilities. It also fixes several irritating 
features of DOS like it supports 
wildcards on the TYPE command. 
Billy 


Compress continued fro m page 32 


#1800 Reply to #1795 6-MAR-1991 
21:10:11.07 

Subject: RE: 4DOS 

It also allows larger programs to be run, 
by swapping out most of its code to 
disk or high memory when not being 
used. 

It also allows wildcards such as 
*K*.DAT to work correctly. 

It also has built in help for all MSDOS 
commands. 

It also has a capability to GREATLY 
enhance batch files, both in speed and 
in color, with boxes, and all that. 

—Another satisfied customer 


TRUE TYPE FONTS 


#1917 17-MAR-1991 21:19:22.15 
Subject: TrueType Fonts availability. 


‘alo! 

TrueType Fonts are now available 
from apple.com. Those that have FTP 
capable can anonymous login to 
apple.com and the files are in 
/pub/dts/sw.liccnse directory. 

troelype-fonU-1 -O.hqx 665,868 bytei 
m»c-printing-iool«.hqx 827,976 byte* 
pereonal-lw-ls-iniull.hqx 468,007 byte* 
Jlylewriler-iniUll.hqx 405,537 by lei 

You will need the first two files and one 
of the last two. It requires System 6.0.7 
so if you don’t have one, you can get 
them in the same directory (at least 4 of 
them: system, print, util 1, and util 2 
and you also need diskeopy 4.1 —also 
in there). 

VAXEN bewam to have disk quota 
enough before ‘get’ them (hint: ‘show 
quota’ then multiply free spaces with 
512). Sequent’s users: well the /u disk 
was at 95% last Friday. 

On campus people can also get them 
from me. Best time for me is after work, 
i.c after 5 pm. Bring at least 3 disks for 
TrueType stuff and 4 more if you also 
want 6.0.7. Ask for ‘mouse’ at Mat¬ 
thews 322B. 

Telephone people, sorry. I can’t 
upload them to the BBS. It requires 
license, you know. When you copy 
System software from me, it is assumes 
(by Apple) that you except the license 
but still does not give me right to 

upload. Go bug ProSystcm or apple 

dealers around here. 

And yes, it does co-exist with ATM. 
There are only 4-5 (system) fonts in this 
release. As a freebie, it comes with 
Font/DA mover 4.1! (Whatever hap¬ 
pens to 3.9 and 4.0?) 

_Mouse.oO~ x-4435 COE:mouse 
mouse<g) vaxb.acs.unt.edu 

x-4437 COE2:mouse 
mousetSmou se.coe.unt.edu ■ 


have had their accounts for a long time but have never compressed their mail will 
also receive a substantial benefit. 

Compressing your mail not only saves disk space, it also improves the overall 
performance of the mail utility by speeding up access to the MAIL.MAI file. An 
unfragmcnlcd mail file is processed faster by the MAIL utility. 

Compressing your mail takes five steps (these examples assume you don’t have a 
mail subdirectory): 

O Enter the VAX MAIIL Utility. 

S mail 

0 Enter the compress command at the MAIL> prompt. 

MAIL> compress 
® Exit the MAIL utility. 

MAIL> exit 

© Reset the file protection on the MAIL.OLD file to allow deletion. 

S set protection=o:rwed mail.old 

© Delete the file called MAIL.OLD. The MAIL.MAI file will be much smaller and 
easier to process. 

$ delete mail.old; 

If you have any questions regarding compressing the MAIL.MAI file, contact the 
VAX operators (Voice: 565-4161, VAX MAIL: operator). ■ 
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% The 
JuNIX 
/ Shell 


Ry Marc St.-Gil, UNIX Systems 
Programmer (mstgil@sol.acs.unt.cdu) 


H ello! And welcome back lo The 
UNIX Shell. This months 
column is a short note about using 
folders in the ‘elm' electronic mail 
package. ‘Elm' is available on sol and 
also on ponder. ‘Elm’ can be installed 
on almost any UNIX system. If you 
want to use ‘elm’ on your UNIX sys¬ 
tem, have your system administrator 
contact me for more information. 

Now then, the question has arisen, how 
do I access ‘folders' in ‘elm’? First of 
all you need a directory to keep your 
folders in. Your account’s home direc¬ 
tory should already have a subdirectory 
named ‘mail.’ This is where your 
folders should reside. ‘Elm’s concept 
of a folder is simply a file containing 
one dr more mail messages. ‘Elm’ 
doesn’t care where you keep a folder, 
but it docs have the idea of a default 
location. The default location for 
folders in ‘elm’ is the ‘mail’ directory 
in your home directory. ‘Elm’ oilers 
you a shortcut to specifying this loca¬ 
tion by beginning the folder name with 
an *=’ character. For example, the 
folder ‘=sent’ refers to a folder file 
named ‘SHOME/mail/sent’. ‘Elm’ 
also understands the idea of a current 
working directory. This means that the 
folder named ‘junk’ will be in the direc¬ 
tory you were currently working in 
when you invoked the ‘elm’ program. 
When you arc in ‘elm’, at the index or 
any of die read mail prompts, you may 
‘s’ave the current message to a 1 older. 


‘d’clctc the current message, or ‘Copy 
the current message to a folder. The 
‘s’ave command is invoked with the 
character‘s.’ This command will save 
your current mail message to the folder 
you specify and mark that message to 
be discarded from your current folder 
when you exit ’elm’ or change to a new 
folder. The ‘d’elcte command is in¬ 
voked with the character ‘d.’ This 
command marks a mail message to be 
discarded from your current folder 
when you exit ‘elm’ or change to a new 
folder. The ‘Copy command is in¬ 
voked with the character ‘C’ (note that 
this is a capital Q. This command is 
the similar to ‘s’avc except it does not 
mark the current message to be dis¬ 
carded. At the index menu, you may 
also issue the ‘c’hangc folder command 
with the character ‘c.’ This command 
will prompt you for a new folder to 
work in. By default, ‘elm’ always 
starts up in your incoming mail folder. 
This folder resides in a special directory 
with everyone else’s incoming mail 
folders. Your incoming mail folder 
will be called ‘/var/spool/mail/userid,’ 
where userid is your user ID (i.e. acO 1). 
NOTE: Any mail you leave in your 
incoming mail folder will show up in 
the unread message count displayed by 
the Finger command. This is just one 
more good reason not to leave mail 
there for any length of time. 

You may want to review some mes¬ 
sages that you have saved with ’elm’ in 
other folders. This is where the 
‘c’hangc folder command becomes 

useful. You may manipulate any folder 

with all the same commands that you 
use in your incoming mail folder. 

For more information on using the 
‘elm’ electronic mail package, use the 
‘man elm’ command. The Usenet 
news group comp.mail.elm also has in¬ 
teresting information. A handout on 
using ‘elm ’ will be available soon in the 
Computing Center (ISB 119). Be sure 
to check for it sometime in the next 
month. ■ 


UNIX News 


Documentation 

• The GNU Emacs Manual has been 
placed, for reference purposes, in 
the following locations: ISB 110, 
5th floor GAB Lab, College of 
Business Computer Center. 

• Vi Reference material is now avail¬ 
able in ISB 119 in the form of a 
handout. “An Introduction to Edit¬ 
ing With Vi,” is free for the asking. 

Software 

The following software has been in¬ 
stalled on the Solboumc and is avail¬ 
able for use: 


* FORTRAN77 
" Perl 

* TCP/IP 
“ E-mail 

* NFS 
Kermit 
Emacs 
csound 

Software planned to be installed in the 
near future: 

" SAS 

° Gaussian 90 
0 USENET Newsreader 
cmix 
0 C-H- 
0 TPU 


UNIX Users 
Group 

T he Mctroplex UNIX Users 
Group meets the first 
THURSDAY of everymonth at the 
Hewlett-Packard facility in Las 
Colinas. Contact Marc St.-Gil 
(565-2324, (mstgil@sol.acs.unt. 
cdu) for more information). 
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Mainframe Performance Statistics 


Operating Systems Performance Statistics for February 


CPU 

ACAD 

ACAD 

ACAD 

ACAD 


ADMN 

ADMN 

ADMN 


SYSTEM 
VM/XA2 
MUSIC/SP 
MVS/JHS2 
COM PI .ETA 


MVS/JES2 
COMPI. ETA 
ADABASA 


Planned 

Production 


Production 

Hours 

System 

Hours 

Achieved 

Uptime 

672.00 

668.74 

99.5% 

649.43 

645.30 

99.4% 

672.00 

665.96 

99.1% 

661.81 

655.43 

99.0% 


672.00 

261.00 

644.04 


669.25 

259.72 

639.16 


99.6% 

99.5% 

99.2% 


• The ACAD CPU achieved 100% uptime in February. 

• The HDS/7360 DASD achieved 100% uptime in February. 

• The I IDS/7380 DASD achieved 100% uptime in February. 

• The A DMN CPU achieved 100% uptime in February. 

• The HDS/7360 DASD achieved 100% uptime in February. 

• The 11DS/7380 DASD achieved 100% uptime in February. 

• The EMC Solid State Disk achieved 92.4% uptime in February. 


ACADemic (HDS) Program Hit Parade 


Key Causes Of Lost Produc- 
tlvlty In February: ACAD CPU 


Miscellaneous 


1. VM0CA systems soft¬ 
ware development. 

2. MVSSP systems soft¬ 
ware development 

3. Undetermined causes 
for systems restarts. 

TOTAL 


4.85 HOURS 

0.85 

0.73 

6.43 HOURS 


Key Causes Of Lost Produc- 
tlvlty In February; ADMN CPU 


Miscellaneous 


1. Systems software 
development. 

2. ADABASA system 
failure. 

3. COMPLETA system 
failure. 

TOTAL 


3.24 HOURS 
1.64 
0 23 

5.11 HOURS 


February Top Ten Programs : Frequency Of Runs 


Program 

Description 

If of 

Runs 

%of 

Total 

1. IEWL 

Linkage Editor 

13422 

14.4 

2. 1KBGENER 

IBM Utility 

13189 

14.2 

3. PGM=*.DD 

Compiled Program 

13023 

14.0 

4. ADARUN 

ADABAS Utility Module 

12158 

13.0 

5. SPC1II.COB 

COBOI.2 Report Writer 

6953 

7.5 

6 . IGYCRCTL 

VS (XJBOI.2 Compiler 

6559 

7.0 

7. ADASQI. 

ADABAS SQL precompiler 

6318 

6.8 

8 . SASI.PA 

SAS 

4211 

4.5 

9. 1DCAMS 

VSAM Utility 

2805 

3.0 

10. IEBPTPCH 

IBM list Utility 

2259 

2.4 


February Top Ten Programs: CPU Seconds Used 


Program 

Description 

CPU 

%of 



Seconds 

Total 

1. SAS370 

SAS Version 6.06 

130887 

403 

2. SASLPA 

SAS Version 5.18 

45155 

13.9 

3. SPCHLCOB 

COHOL2 Report Writer 

41629 

12.8 

4. PGM=*.DD 

Compiled Program 

32496 

10.0 

5. ADARUN 

ADABAS Utility Module 

11678 

3.6 

6 . SPSS 

SPSS Version 4.0 

11210 

3.5 

7. COMPI.ET4 

Academic COM-PLETE 

10930 

3.4 

8 . IKWL 

linkage Editor 

8233 

2.5 

9. IGYCRCTL 

VS COBOU Compiler 

8029 

2.5 

10. SSS4001 

Operations Automation 

3221 

2.5 
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SYSTEM 

Administrative MVS/SP 


Academic MVS/SP 


MUSIC/SP 


VM/XA 


VAXcluster 


Disk Backup Schedules 


BACKUP 

DESCRIPTION 

Daily 

Monday - Friday around 7 p.m. (after COM-PLETE is 
shut down) & on Saturday & Sunday if COM-PLETE has 
been up that day. 

Weekly 

Full pack dumps taken each Sunday morning. 

Monthly 

Full pack dumps taken on the first day of each month. 

Daily 

Monday - Sunday during the early hours of the morning. 

Weekly 

Full pack dumps taken each Sunday. 

Monthly 

Full volume dumps taken on the first day of each month. 

Daily 

Wednesday - Monday starting at 4 a.m. and lasting about 
30 minutes. 

Weekly 

Tuesday mornings at 3 a.m., these last about 2 hours. 

Semester 

Once a semester, a permanent backup is taken. 

VM Weekly 

Early every Wednesday morning. 

CMS mini-disks 

Daily backup performed early every morning. Weekly 
backup every Tuesday starting after Midnight. 

Semester 

Once a semester, a permanent backup is taken. 

Daily 

Incremental backups arc performed 

Monday -Thursday at 6 p.m. Saturday & Sunday at 5 p.m 

Weekly 

Full backups are performed every Friday beginning at 8 
a.m. generally last all day. 

Monthly 

A “stand alone” backup is performed monthly. Dates and 
times arc given in the system log-on message. 

Semester 

Once a semester, a permanent backup is taken. 


A full description of the system backup procedures can be found by typing HELP BACKUP on MUSIC or the VAXclusicr. 
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